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1 Introduction 

1.1 Compactification of Moduli of Instantons 

This paper consists of two parts. The purpose of part I is to intro- 
duce a smooth compactification of moduli of instantons. Here, we say a 
space is smooth if it is a smooth orbifold. It is well known that moduli 
spaces of instantons play very important roles in the study of differen- 
tial four-manifolds. The most famous example is the Donaldson theory 
(H). In general, the moduli spaces are not compact. One of the fun- 
damental problems is to give a compactification. The most commonly 
used compactification is the so-called Uhlenbeck compactification (||, 
H). The Uhlenbeck compactification works well for many purposes in 
the Donaldson theory. But the Uhlenbeck type compactification is less 
successful in tackling the problems in which studies of lower strata are 
involved. Examples are the Kotschick-Morgan conjecture (p|, p|) on 



the wall-crossing formula and the Witten conjecture (fl22[|, ffji]]) on the 
relationship between the Donaldson invariants and the Seiberg- Witten 
invariants. The problem with the Uhlenbeck compactification is that 
the spaces are only statified spaces and very singular in lower strata. 
Although the neighborhoods of lower strata can be described (|2l|) and 



some applications are worked out [ IS | ) , they are intractable in gen- 
eral. For example, the Kotschick-Morgan conjecture has been around 
for almost ten years and a great deal of efforts has been made to solve 
it. So far, it remains unsolved. The author believes that the most im- 
portant reason for the lack of closure is the fact that the Uhlenbeck 
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compactification is not good enough. 

For anyone who knows algebraic geometry, it is not difficult to ap- 
preciate the importance of a good compactification, which is often the 
make-or-break point of a problem. A tremendous amount of litera- 
ture has been devoted to construct a good compactification in various 
geometric problems. An important notion emerging from these stud- 
ies is the notion of "stability". Let us take the example of quantum 
cohomologies, which is similar to the gauge theory in many ways. A 
pseudo-holomorphic map from a Riemann surface (smooth or singular) 
has a natural automorphism given by the automorphism of Riemann 
surface. Such a pseudo-holomorphic map is called "stable" if the stabi- 
lizer is finite. Once the stability is achieved, the moduli space behaves as 
good as a smooth orbifold via so-called "virtual cycle" or "virtual inte- 
gration" techniques. The moduli space of stable maps was constructed 
by Parker- Wolfson-Ye-Kontsevich and has been proved to be extremely 
important for quantum cohomologies. Indeed, the computations in the 
theory of quantum cohomologies has gone very far based on the moduli 
spaces of stable maps. Using a similar analysis, Parker- Wolfson ( fl9| ) 
pointed out a similar compactification should work for moduli spaces 
of instantons. We refer these type compactification as the bubble tree 
compactification. The author was pointed out by T. Mrowka that the 
bubble tree compactifcation had been known by many people in 80's. 

However, there are instances where the stability fails when the bub- 
ble tree compactification is used for the moduli of instanton. Even in 
the generic situation, the corresponding moduli spaces will have singu- 
larities. As far as the author knows, there is no general method to deal 
with the singularities caused by the failure of stability. Unfortunately, 
a close examination of bubble tree compactification shows that the sta- 
bility fails at "ghost strata" (see §3.2 ). This is very disappointed! To 
search a smooth and reasonible compactification, a new idea is needed! 

The main purpose of this paper is to introduce a method to resolve 
these singularities. Technically, it is similar to the "flip" in algebraic 
geometry where we first perform a blow-up and then a blow-down. It 
is easy to do a blow-up to obtain a manifold with boundary. The key is 
that a blow-down can follow to obtain a closed space again. Topolog- 
ically, it is equivalent to the well-known interchange of handles. Such 
a blow-down follows from our key observation that "ghost strata" are 
closely related to the Fulton-McPherson compactification (]ll|) of con- 
figuration of points. This main technique is given in §4. 
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1.2 Wall-Crossing Formula for 4-manifolds with b + = 1 

In §5, we give an affirmative answer to Kotschick-Morgan conjecture. 
Our new compactifation is one of our main ingredients. With this 
smooth moduli space, we are able to apply the localization technique 
which is our other ingredient to solve the problem. The idea can be also 
applied to the moduli of ?7(2)-monopoles, which is more complicated 
than the case considered here(0]): we also have a new type compactifi- 
cation for this moduli space; with this space, the "virtual" localization 
technique and the method developed in §5, this leads to an alternative 
proof of the Witten conjecture mentioned earlier with certain condition 
(1). 

Let X be a smooth, simply connected, closed compact 4-manifold. 
Denote by b + the maximal dimension of a maximal subspace of H 2 (X, M) 
on which the cup product is positive definite. Let P be an S'0(3)-bundlc 
over X. It is well known that the Donaldson invariants (or Donaldson 
polynomials) based on the moduli space A4p(X), are well defined when 
b + (X) > 1. Moreover, when b + > 2 they are unique in the sense that 
the invariants are independent of metrics as long as they are generic. 
For b + = 1, the Donaldson invariants depend on the metric. In fact, the 
invariants depend on chambers in H 2 (X, R) determined by P. Let Ap 
be the set of chambers. Donaldson invariants are expressed as a map 

*£ c : A P ^ Sym d (H 2 (X,Z)), 

where c G H 2 (X,7j) is an integral lifting of W2(P)- \&p c (C) is the 
Donaldson polynomial in terms of the chamber C. The invariants vary 
with the chambers according to a wall-crossing formula. Let C\, C_i be 
two chambers. Suppose A is the collection of walls between them. Each 
wall corresponds to an element of H 2 (X,7*). So we can identify A as a 
subset of H 2 (X,Z). The following is conjectured [16] 



Conjecture (Kotschick-Morgan): Suppose 

*£ c (Ci)-*£ )C (C_i) = ^e(c,a)5 P (a). 
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where e(c, a) = (c — a) /2, then 

r 

5 P (a) ^ n, !>.,/.. \>/ r 'o'' 
i=0 
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where q is the intersection form of X and the coefficients <Zj depend only 
on r, d and homotopy type of X. 

Assuming the conjecture, Gottsche has derived an elegant formula 
for 5p(a) in terms of modular forms. So the remaining question is the 
conjecture itself. The core of the problem is to prove the dependence on 
the "homotopy type" of X. For example, we show in theorem 5.12 that 
Oj's depend on the Euler number \ an d signature a of X which are well 
known homotopy invariants. One of the main ingredients is our smooth 
compactification. 5(a) can be computed via a localization formula of 
the equivariant theory for models with S^-action. But when we use the 
Uhlenbeck compactification, these models become very singular because 
of the structure of lower strata. In practice, computations are unavail- 
able. It turns out that our smooth compactification fits the equivariant 
structure perfectly. This enables us to apply the ordinary localization 
theory. Beginning with this observation, we reduce the proof of the con- 
jecture to computing p\ of some bundle over a space, which itself is a 
fibration over X with fiber M b K . Fortunately, this problem can also be 
dealt with by using the equivariant theory again. Therefore, we prove 
the conjecture. Details are given in §5.3. 
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Updates: This is an updated version of the author's thesis (0]). Re- 
cemtly, Feehan-Leness (f8[) announced a proof to the Witten conjecture 
along their cobordism approach. In the same paper, they also mentioned 
that their approach can also work for the Kotschick-Morgan conjecture. 
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2 Background 



We give a quick review of some basic facts about instantons and their 
moduli spaces for four-manifolds. Readers are referred to Q for most 
of the details. 

Let (X,g) be a simply connected 4-manifold with metric g. There 
are two kinds of manifolds considered in this paper. One is manifolds 
with b\ > 0; the other is S 4 with standard go . For S 4 , using the 
stereographic projection s from the south pole to the equatorial plane, 
we identify s : S 4 \ {south pole} — > R 4 . Alternatively, and oo are used 
for the north and south poles of S 4 respectively. Via s we pull back the 
standard coordinate functions on R 4 to functions Xj, 1 < i < 4, on S 4 . 

Suppose that P is a principal G-bundle over X, where G is either 
£0(3) or SU{2). When G = SU(2), let E be the associated rank 2 
vector bundle of P and K = 02(E). When G = SO (3), choose c G 
H 2 (X, Z) to be an integral lifting of 102 (P). There exists a f7(2)-bundle 
£ such that p\(E) = Pi(P) and c\(E) = c. We say that E is associated 
to (P,c). Let no be the biggest integer satisfying d(no) > and uq = 
Pi(P)(mod4). Set = (c 2 - n )/4 and let K = c 2 {E) - 0. Now fix a 
connection on the determinant line bundle det(E'). For G = SU(2), 
choose ciE = 0. For a connection A of E we write det(A) to be the 
induced connection on &et(E). For both cases define 

A E = {A\det(A) = a E }- 

Qe, the Gauge group of E, consists of gauge transformations of E pre- 
serving det(E). Define Be = Ae/Ge- Be is called the configuration 
space. Let A* E C Ae be the set of non-reducible connections and 
B* E = A* e /Qe- All definitions can be given directly by using P instead 
of E. A connection A is called anti- self- dual (or ASD) if 

F+(A) = 0. 

Class [A] € Be is called an (anti-)instanton. Me{X) is the collection 
of ASD-connections of E. The moduli space of instantons is 

M E {X)=M E {X)/G. 

Let qe be the adjoint bundle of E. There is a complex 

- n\ 9E ) ^ ^(qe) ^ n + ( 9E ) -> 
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for any ASD-connection A. The complex gives the Kuranishi model for 
the "virtual" tangent space of Me(X) at [A]. Let H A , % = 0, 1, 2 be the 
cohomologies defined by this complex. We say that [A] is regular if Ha = 
H\ = and semi-regular if H\ = 0. We always assume that H\ = in 
this paper. When H A ^ 0, A is called reducible, otherwise A is called 
irreducible. A4* E (X) is the moduli space of irreducible instantons. [A] 
is regular iff it is irreducible. When [A] is regular, H A is the tangent 
space of [A] in the moduli space. When [A] is semi-regular, first of all, 
[A] is reducible: its isotropy group Ta is either S 1 or G; secondly the 
local model of [A] in the moduli space is then given by H a /Ta- When 
h% (X) > and 02(E) 7^ 0, the transversality theorem (||, ||) says 
that A4e(X) = A4* E (X) is smooth for generic metrics g. For X = S 4 
with standard metric go, instantons are regular if 02(E) ^ 0. When 
02(E) = 0, the trivial instanton is semi-regular and its isotropy group is 
SO(3). For general X, when G is SU(2) and 02(E) = 0, H 2 is nontrivial 
for the trivial connections. We exclude this case in our paper. In general, 
in order to consider the trivial connection, one should introduce the 
"thicked moduli space" ( , ||io|| ) . For generic metrics, the dimension of 
the moduli space is computed by the Atiyah-Singer index theory: 

dimM E (X) = d( Pl (E)), 

where d is defined by 

d(n) = -2n-^( X + a), (1) 

where x 1S the Euler number of X and a is the signature of X. 

One would see that when X = S 4 , the dimensions are 5(mod8). In 
fact when X = S 4 , we are more interested in a modified moduli spaces. 
These are the original moduli spaces modulo a 5-dimensional group 
H. H is a subgroup of the conformal group of S" 4 . It is generated by 
translations and dilations on R 4 . Any conformal transformation induces 
an action on moduli spaces. So we can define M E = M.e(S 4 ) / H . 
Alternatively we can describe this moduli space more explicitly: let h 
be a constant less than An 2 . Define M b K to be the collection of ASD- 
connections A with the properties 

/ x\F(A)\ 2 = and / \F(A)\ 2 = h. (2) 
Jr 4 Jb(i) 

The first equation requires that the mass center of A is and the second 
one requires the energy in unit disk B(l) to be equal to a small constant. 
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We call (2) the balanced condition. An ASD-connection A is balanced if 
it satisfies (2). Then M b E = M b E (S A )/g E , where M b E {S 4 ) is the set of 
balanced ASD-connections. 

Fix a point xq € X. When X = 5 4 , we always choose xo = oo. 
Suppose is the subgroup of Qe that consists of gauge transformations 
with identity at point xq. Then 

u : A* E /g° E - 

is an 50 (3)-bundle. We denote the bundle by P^ when G = SU(2), 
and by P^ c when G = 50(3). If no confusion is caused, we just write 
it as P°. Alternatively, this bundle can be given by 

u : A* E xg E P xo -» B* E . 

Let M%{X) = u~ 1 {Me{X)). There is another important 50(3)- 
bundle over B* x X which plays a crucial role in the Donaldson theory. 
This bundle is defined by 

V : A* E Xg E P - B* E X X. 

We denote this bundle by P# or Px jC depending on G. In general, we 
write it as P. Similarly, M EX = v- 1 (M E {X) x X). Using pi(P), one 
defines 

fi:H4X;Z)^H 4 -*(B* E ;Q) 
by taking the slant product //([—]) = —\pi(^)/[—]- F° r the O-class [xq], 

MM) = 4pi(p°). 

As we mentioned in the introduction, the bubble tree compactifica- 
tion is our initial step towards our new compactification. It is based on 
the following compactness theorem. 

Theorem 2.1 (Parker- Wolfson) . Suppose that L4 n ]^i is a sequence 
of instantons in M.e{X). Then there exists a subsequence of [A n ] that 
converges to a bubble tree instanton [A] in Me{X). 

Definitions of M.e{X) and bubble tree instantons are given in §3.1. 
Suppose P,E,K are as above. Let E_i,K > I > 0, be the bun- 
dles such that c\{E-i) = c\{E) and c%(E-i) = c 2 (E) — I. Roughly 
speaking, a bubble tree instanton [A] is a sequence of finite instantons 
([Aq], [Ai], . . . , [Ak]) on components: X and 4-spheres Sf, 1 < i < k. X 
is called the principal component and 5 4 's are called bubbles. We call 
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[Aq] the background instanton of [A]. [Aq] should be in A4e_ 1 (X) for 
some I. Then we say that [A] is in the l-level stratum. Suppose X ^ S . 
If [Aq] is trivial, we say that [A] is trivial. If [Aq] is reducible, we say [A] 
is reducible. Let F^; be the collection of trivial bubble tree instantons. 
In this paper, we assume that = 0. For example, when G = 50(3) 
and W2(P) 7^ 0, Fe = (fll5|). Essentially, this is the case that we are 
interested in. To avoid the confusion caused by the complexity of nota- 
tions, we restrict to G = SU(2) and make the assumption that F^; = 0. 
Also note that E is determined by K up to bundle isomorphisms. We 
replace the subscript by K in general. For example, Mk{X) is 
the same as Me(X) from now on. If we consider G = 50(3), then the 
subscript should be K, c. 

In the next section , we will define A4k(X) and its topology, then 
study its topological and smooth structure using the gluing theory. 

We now review the problem of the wall-crossing formula. In the rest 
of the chapter, assume (X) = 1 and P — > X is an 50(3) principal 
bundle with W2(P) ^ 0. Let c be an integer lift of w 2 {P) and £ be a 
f/(2)-bundle over X with a(E) = c,pt(E) = pi(P). Set K = c 2 {E) - 
as before. Suppose gx is a generic metric. Let M K (X, gx),Mi<:(X, gx) 
and Ai K (X, gx) be the compactified spaces of Aix(X, gx) in the sense 
of the Uhlenbeck compactification, the bubble tree compactification and 
the smooth compactification which is given later in §4. The map \i : 
H 2 (X,Z) -> H 2 (M K {X,g x ),Q) defines 2-classes /i(E),S € H 2 (X). 
//(E) can be extended to /u(S) over M K (X, gx) (@, flO|l ). There are 
two different approaches: in | a line bundle Cy, over M. K (X, gx) is 
constructed and /2(S) is the c\ of the bundle; in [IQ] the proof is more 
algebraic-topological. Here we follow the approach of ||. Note that 
there are natural projections from both Mk(X, gx) and Ai K (X, gx) 
to A4 K (X,gx)- So /2(S) is also well defined over these two spaces via 
pull-back maps. The Donaldson invariants are defined to be the pairing 
of compactified moduli space with cohomology classes given by [i. One 
can show that the Donaldson invariants are independent of compactified 
spaces we choose. However, when b^(X) = 1, the pairing is no longer 
metric independent: let g_i,gi be two generic metrics and A = gt be a 
path of metrics connecting them. Define 

M K (X,X) :={([A],t)\[A]GM K (X,g t )}. 

This forms a "cobordism" between A4k(X, g^), i = —1,1. Unfortu- 
nately, it is not an actual cobordism since there may be reducible con- 
nections in the space. We can still apply compactification theories to 
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this space. Then M K (X, A) forms a "cobordism" between M K (X, gi), 
and so is true for other two different compactifications. are not 

well defined wherever reducible connections appear. Even worse, re- 
ducible connections can be at lower strata. Suppose A is a reducible 
connection on the top stratum. Then it gives a reduction of P to an 
^-bundle Q a , P = Q a x 5 i SO(3). Here d{Q a ) = a,a € H 2 (X,Z). 
We say that A is reducible with respect to a. Let L a be the line bundle 
associated to Q a . Correspondingly, E splits as 

E = L 1 a / 2 ®L 1 c / 2 ®L- 1 / 2 ®L 1 J 2 . 

If A € M.K(X,g), F4 = 0. Suppose that oj(g) is the self-dual harmonic 
2-form with respect to g. Then a ■ uj(g) = 0. In general, the reducible 
connection [A] with respect to a is in M K (X,g) if and only if 

Lo(g) G W a := {x € H 2 (X,R)\x 2 > 0,x • a = 0}. 

where H 2 (X, R) is identified with the space of harmonic forms. We call 
W a the wall associated to a. W a is called a wall of P-type if W2(P) is 
the reduction of a modulo 2 and > a 2 > p±(P). The set of all walls 
W a of P-type is called P- walls. The set of chambers Ap of X is the set 
of components of the complement, in the positive cone of H 2 (X, R), of 
the P-walls. The Donaldson invariants associated to a bundle P depend 
on the chamber where the period point of the metric is located. The 
invariants vary when period points of metrics pass through P-walls. Let 
d = -pi(P) - 3. : A P -» Sym d (H 2 (X,Z)) is the map such that 

<3?p c (C) defines the Donaldson invariants for the chamber C. 

We now state the Kotschick-Morgan conjecture in terms of the Uh- 
lenbeck compactification. Suppose [A] £ M. K (X : X) is a reducible con- 
nection with respect to a. Let r = (a 2 — pi(P))/4. Then the family 
of reducible connections is [A] x Sym r (X). Let denote the link of 

the family of reducible connections. This defines an element 6p(a) in 
Sym d {H 2 (X,Z)) as follows: let z G Sym d (H 2 (X,Z)), 

S P (a)(z) = {fl(z),D(a)}. 

If C_i and C\ are chambers, then 

<&£ c (Ci) - *£ iC (C_i) = ^6(c,a)<5 P (a). 

where e(c,a) = (-1)( C " Q ) 2 / 2 . 
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Conjecture 2.2 ([|l6[]). With notations as above, 

r 

5 K (a) = J2^(r,d,X)q r - l a d - 2r - 2 \ 
i=0 

where q is the intersection form of X and the coefficients ai depend only 
on r,d and homotopy type of X. 

3 Gluing Theory, The Topological and Smooth 
Structures of Moduli Space M.r :(X) 

Before we resolve the singularities of Mk(X), we first have to under- 
stand its smooth structure and the structure of singularities. The chap- 
ter is organized as follows: in §3.1, we define the bubble tree compact- 
ified space M.k{X); The topological structures and the smooth struc- 
tures are discussed in §3.2 and §3.3 respectively by using the gluing 
theory. 

3.1 Bubble Trees and Strata 

Let Mk(X), M h K be as before. We denote their compactified spaces by 
M-k(X), Ai K - They are stratified spaces. We begin with the descrip- 
tion of their strata. 

Each stratum is associated with a connected tree with additional 
requirements. Let T = (V, D) be a tree, where V is the set of vertices 
and D is the set of edges. Choose a vertex vq G V to be the root of T. For 
each v (zV there is a unique path connecting v and vq. The length of the 
path (i.e, the cardinality of edges in the path) is called depth(v). We say 
that vertex m is an ancestor of vertex va if the unique path connecting 
Vi and Vj does not pass through vq and depth(vi) < depth(vj), and we 
also call Vj a descendant of V{. Moreover, if depth(vj) = depth(vi) + 1, 
Vi is called a parent of Vj and Vj is a child of Uj. In other words, u, is 
an ancestor of Vj and there is an edge e = (vi,Vj) in D. From now on, 
when we write an edge e = (v{ , Vj ) we mean that V{ is the parent of Vj . 
For each vertex v £ V, let V(v) be the set of vertices consisting of v 
and all descendants of v. Define t(v) = (V(v), D(v)) to be the subtree 
of T induced by V(v), i.e, D(v) consists of all edges in D connecting 
vertices in V(v). We now assign each vertex v a nonnegative integer 
w(v). w(v) is called the weight or charge of v. We call such a T to be a 
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weighted tree. Given a vertex v, define W(v) to be the sum of charges 
of all vertices in t(v). We call W{y) to be the total charge of v. Let 
child(v) be the set of all children of the vertex v. 

Definition 3.1. A weighted tree T with a prechosen root vq is a bubble 
tree if for any vertex v we have either 

• w(v) ^ 0, or 

• \child(v)\ > 2, and W{vi) > for all Vi G child(v). 

We write the bubble tree as (T, vq) or T. 

Note that for each vertex v in a bubble tree T the associated subtree 
t(v) is still a bubble tree. And t(vi) is called a bubble tree component 
of v if Vi G child{y). Given an edge e = (vi,Vj), we say that the tree 
T' = (V',D r ) is the contraction of T at edge e if V = V \ {v-j} and 
D' is the union of edges in D that do not connect Vj and new edges 
e' = (vi,vi),V[ G childivj). w(vi) is updated to w(vi) + w(vj). If T is a 
bubble tree, T 1 is also a bubble tree with the same root. For a bubble 
tree (T,vq), define the total charge W(T) of T to be W(vq). Given an 
integer K > 0, let 

T K = {(T,v )\(T,v ) is a bubble tree and W(T) = K}. 

We introduce a partial order on T^: T\ < T2 if T2 is obtained from T\ 
by a sequence of contractions. It is easy to prove 

Lemma 3.2. \Tk\ < co- 

Now we focus on X = S 4 for a moment. Suppose (T, fo) is a bubble 
tree and T = (V, D). We assign 4-spheres to each vertex Vi £ V. For 
each edge e = (vi,Vj) or € child(vi), we assign a point ^(e) = Pij £ 
S 4 4 . Moreover dr(e) / c^r(e') if e 7^ e'. Abstractly, we define a bubble 
tree space V of T to be a pair Y = (]J i S" 4 , dx(D)). Geometrically, Y is 
realized as a quotient space ]J Sf / ~: for each e = (vi, Vj), dr(e) ~ 00 j. 
We denote V by UjSf/dT- For any and edge (vi,Vj), dr(vi,Vj) is 
called a bubble point on S* 4 . For p = (pi,... ,p n ) £ (-S* 4 \ {oo}) ra we 
assign charges (or energies) to = (w\, . . . , w n ) G (Z + ) n to it, i.e, Wi is 
the charge of pi. S n , the n-permutation group, acts on p and to in the 
standard way. Define S m < S n to be the kernel of action on to and 

N m := (S 4 \ {oo}) n /S n . 

The class of p in N m is denoted by [p] = [p± , . . . , p n ] . Note that the 
group action H on S 4 \ {00} is also well defined on N m . 
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Definition 3.3. A generalized instanton on S A is an instanton [A] £ 
■Mk (S 4 ) with [p] = [pi, . . . ,p n ] G -A^, where tt) = (fei, . . . , fc ra ). Charges 
hi are called the 5- mass atpj. T/ie moduli space of generalized instanton 
denoted by A / Ifc 0jTO (5 4 ) is .M^S* 4 ) x jV m . Define 

M b k0 , M = M hhW (S 4 )/H. 

The element [[A], [p]] in M% m is called a balanced instanton. The rep- 
resentative of [[A], [p]] can be chosen to satisfy the balanced condition: 

1. the (generalized) mass center m is the north pole, namely 

m := x(m([A]))ko + x(pi)k\ H h x(p n )k n = 

2. one of the following cases holds: 

(a) the charge of [A] at south hemisphere equals the constant h 
and pi 's are located in the open north semi-sphere, 

(b) [A] is nontrivial and the charge of [A] at south hemi-sphere 
is less than h. Also pi 's are located in north hemi-sphere and 
there exists at least one pi on the equator. 

(c) [A] is trivial; then 

n 

X 2 ( P i) + X 2 ( P2 ) + ■■■ + X 2 ( Pn ) = h- (3) 

i=l 

The instanton is denoted by [[A], [p]]m- 

Example 3.4. Suppose ko = 0, p = (pi,P2) and Xo = (1, 1). Then 

M b 0;to = S 3 /Z 2 = RP 3 . 
Directly from the definition, we know that 
Lemma 3.5. A4 b k m is a smooth manifold. 

Definition 3.6. (T, vo) is a bubble tree and T = (V,D). Suppose Y 
is a bubble space of (T,vo), where Y = (]J V . GV Sf) / 'dr , and instantons 
[Ai] G M. w ^{Sf). For each Vi, suppose child(vi) = {vn, . . .vu x }. Let 

pi = (d T (vi,vn),... ,d T {vi,v ih )) 
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and 

tot = {W(vn,... ,W(v Ui )). 

Data (Y, [Ai]) Vie y generate generalized instantons [[Ai], [pi]] Wi on Sf. If 
each element [[Ai], [pi]]m is balanced, then {[[Ai], [pi]]tOi)vieV i- s called a 
bubble tree instanton. The stratum, denoted by St{S 4 ), is the set of 
such bubble tree instantons. 

Let Mt be the set of bubble spaces of some bubble tree instan- 
tons in St{S a ). Suppose Y € Mt- Note that Y can be represented 
as (JJ Sf, dx)- For each v G V, suppose child{v) = {v\,... ,v n } and 
to v = {W{vi), ... , W{v n )). The action S Wv on {d T {v, vi), ... , dr{v, v n )) 
induce an action on dx and so on Mt{S ). Define Mt{S a ) = Mt/S Mv . 
Elements of Mt{S a ) are called bubble tree manifolds. 

we make a convention on representations of bubble trees and bubble 
tree manifolds. We denote by R{T) the representation of T. We define 
it inductively: Suppose vq is the root and child(v) = {v±, ... ,Vk}- Their 
weights are Wi,0 < i < k. Then 

R{T) = [w [w! ■ R{t{v 1 )),w 2 ■ R{t{v 2 )), ... ,w k - R{t{v k ))]]. 

In this equation, • is only used to make the equation easy to read. 
Similarly, we can represent a bubble tree manifold in the same way. 
Suppose Y is a bubble tree manifold of T defined by dr- To represent 
Y is essentially to represent dr if X is given. We denote by Y{dx) the 
representation of dx- Suppose pi = dx{vo,Vi), 1 < i < i, then 

Y{d T ) = [pi ■ Y(d t(vi) ),p 2 ■ Y{d tM ), ... ,p k - Y{d t{vk) )]. 

Follow the notations, one can see that 

Lemma 3.7. Mt{S ),St{S ) are smooth manifolds. 

Proof: We prove this by induction. If T consists of one vertex, then 
the lemma follows from lemma 3.5. Else, suppose 

child{v ) = {vi, ... ,v n } and ro = (W(ui), . . . , W{v n )). 

St(S a ) is a fiber bundle over M b w r VQ \ ro and the fiber is a product of 
S t t Vi y Then St{S 4l ) is smooth if S t < v .\ are smooth. This can be finished 
inductively. The proof for the Mt(S 4 )'s is the same, q.e.d. 

Define 

M b K = U T er K S T {S 4 ). 
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Now we explain how these definitions can be modified and general- 
ized to general X. Given a manifold X and a bubble tree (T, vq), we de- 
fine bubble tree manifolds and bubble tree instantons inductively. First 
assign the root vq for the manifold X. Then for any edge ej = (vq, Vj) in 
T, define a bubble point pi := dj>(ei) G X. For all vertices in the subtree 
t(vi), the assigned spheres are TM Pi , the one-point compactification of 
the tangent space TM p . . Bubble tree instantons are the modified bubble 
tree manifolds with instantons on each component and they satisfy the 
same requirements given in definition 3.6 except the one on root X. The 
instanton on X is in A4 w r VQ \(X). Again strata are defined to be the set 
of bubble tree instantons. We denote the stratum by St(X). Mt(X) 
is defined similarly. It is not obvious that the strata defined this way 
are still smooth, since the strata are parametrized by bubble points on 
X and we define the space pointwise. However this is still true, indeed 
it is a fiber bundle over X n /S m for an appropriate ro. This can be seen 
easily from a better version of description given in §3.2. Define 

M K (X)=U T eT K S T (X). 

We now introduce a set of terminologies concerning ghost bubbles. 
Suppose T = (V, D) is a bubble tree and M is a bubble tree manifold 
of T. By a ghost vertex v we mean that w(v) = 0. Gt is the set of 
ghost vertices of T. The corresponding component in M is called a 
ghost bubble. If T contains any ghost vertex, we say T is a ghost bubble 
tree and St{X) is a ghost stratum. Let Qk C Tk be the set of all ghost 
trees. Define 

S K (X) = U Te g K S T (X). 
We call Sk(X) the singular set of Mk(X). Similarly, all definitions 
apply to M.k- 

3.2 The Gluing Theory 

To patch all strata together and give Ai a manifold (or orbifold) struc- 
ture, the standard tool is the gluing theory, which has been developed 
intensively ([P|,pl|). The basic idea is that the gluing method con- 
structs local charts for the compactified space. In this section, we go 
over the gluing theory. In addition, we consider how to "patch" glu- 
ing maps for different strata. Therefore, we are enable to conclude the 
smoothness of the whole compactified space. This is explained in §3.3 
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and necessary estimates are given in this subsection. This type issue 



was first discussed by Ruan [20] in the quantum cohomology theory. 

We are going to encounter a large collection of notations concerning 
fiber bundles in this paper. Suppose ir : X — > Y be a fiber bundle with 
fiber Z. To make the notations more suggestive, we write X = YxZ if 
no confusion is caused. 

Given a stratum St(X),T £ Tk, there is a so-called gluing param- 
eter GIt- Roughly speaking, the gluing theory is to study a gluing 
map 

f T : S T {X)xGl T -> M K (X). 

Also for any T < T' there is an associated gluing parameter GI(t,t') C 
G7 T . maps S T (X)xGl(T,T') to 5 T /(X). In the gluing construc- 
tion, a very important step is to construct approximating solutions via 
"splicing" . In particular, this is the case when we expect a global gluing. 
This step is to construct a splicing map 

: S T (X)xGl(T,T') -» Br> 

for all T <T'. 

We study the base case. Suppose bubble tree T consists of only two 
vertices vo, v and one edge e = (vq, v). The weights are k\ = w(vq), &2 = 
w(v) and let K = k± + A;2. -R(^) = [&o[&i]]- Assume fci&2 7^ 0. Bubble 
tree manifolds are given by dx{e). Suppose dx{e) = p,p G X, then 
V(dr) = \p\- So Mr is parameterized by X. To be consistent with 
notations below, let X\ = X, X2 = S 4 and p\ = p,P2 = co- 

The stratum St(X) is described as follows: Let Fr{X) be the frame 
bundle of X, P\ — > X and P2 — > <S 4 be SU(2) principal bundles with 
C2(Pi) = ki. Ei are associated vector bundles of Pj. Define 

A^(S 4 ,X) = Fr(X) x so(4) A^(S 4 ), 
M b K (S\X)=Fr(X)x so(A) M b K . 



and 
Then 



S T (X) =M kl (X) xM b k2 (S\X) 



and it is also a bundle over X. The gluing data of stratum St(X) is a 
bundle 

5/ : GL T = x Fr(X) x 50(3)xSO{4) X^°(S 4 ) x P+ ^ 5 T (X), 
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where the action SO(3) x SO (4) on elements in A4 b ^(S A ) comes from 
the action of SO (4) rotating S 4 and the action of 50(3) = SU(2)/{±1} 
on the base frame. Note that Mt is a factor of Sr(X). In this case 
Mt = X. Locally for a p € X the geometric meaning of the fiber 

gl~ x {*,p) is 

Gl(p) ■= Hom SO (3)(SE 1 {p),QE 2 (oo)) x R + . 

This is isomorphic to 50(3) x R + = R A \ {0}/Z 2 :=Gl T \ {0}, where 
GIt = R jZi- We call the parameter R + in GIt the gluing radius. Use 
GLy(r) for the set in GLr with radius < r. 

We say a set U in a space is proper if its closure in the space is 
compact. The gluing theorem is now stated as 



Theorem 3.8 (pi]), ^or any open proper set U £ St(X) there exists 



a small constant €q, depending on U, and a gluing map 

*t : GL^(eo) D gl~ l (U) -» 
suc/i i/ioi ^>t is a diffeomorphism. 

The proof consists of two parts: part 1, the local diffeomorphism of 
and part 2, the injectivity of ^t- 

Part I, local diffeomorphism of 

The idea is that can be constructed gauge equivariantly. So 
we can work on A* instead of B* . Also the local diffeomorphism is 
a local issue. We set up the local coordinates for gluing. Suppose 
that [A^],i = 1,2 are in Mk^X), M\ 2 ; U\,U\ are their neighborhoods; 
p (z X and B p (l) is a ball at p with radius 1. We identify the ball 
with the unit ball in R 4 . (Readers are referred to [21| for a careful 
explanation.) Then locally is a map from 

* T : U(e) := U x x U\ x B p (l) x SO (3) x (0,e) -» A4^(X). 

It is convenient to compare this gluing map with the one for a more 
classic model. This comparison also suggests a reasonible treatment of 
B p (l) x R + in the domain of ^t- 

We modify the domain of *$>t by: fixing a r E (0, e) and a point 
p € X, replacing M b k2 by M. k2 {X2). For this set-up, X2 can be arbitrary. 
This is the case considered in M . But eventually, we are only interested 
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in S 4 . Moreover identify S A \ {00} = TX p . Define gluing data to be the 
bundle 

GL'(p,r) = MKX,) x so(3) M° k2 (X 2 ) 

over M kuk2 (X 1 ,X 2 ) := M kl {Xi) x M k . 2 (X 2 ) and gluing map $(p,r) : 
GL'(p,r) — > A4_ft:(X) which is defined later. 

Proposition 3.9. Given proper sets U in GL'(p,r), <J>(p, r) is a local 
diffeomorphism for any r < eo- eo depends only on U. 

Locally write 

$(p,r) : C/i x C/ 2 x GZ(p) -> 

where C/j C PQ)- We prove that this map, constructed in the proof, 
is a local diffeomorphism. The proof consists of four steps. 
Step 1, splicing maps and approximate solutions: Identify B Pi (l) \ {p^} 
with the cylinder i? 1 x S 3 by mapping points the polar coordinates 
(s,9) to (logs, (9). Let X- = X{ \ B Pi {N~^r) for some large constant 
N. Identify the annulus B pi (N _1 , Nr) := B Pi {Nr) \ B^N^r) with 
the tube [logr - T, logr + T) x S 3 , where T = log AT. Denote the tube 
by C p . [logr — T, logr + T], or C Pi if no confusion is caused. Define 
the connected sum X r = X$S 4 by gluing X{ at C Pi with identification 
(t, 9) ~ (2 log r — t,9). S 4 should be naturally treated as one-point com- 
pactification of (TX) p , so we can simply assume that X r is conformal 
to X. Suppose gi are standard metrics of Xj. We choose a metric g r 
on X r with g r = migi over X[ such that 1 < m,i < 2 on the gluing area 
and equal to 1 elsewhere. The Sobolev norms L ' p , L q on X r are defined 
with respect to this metric. Also we fix p, q such that 

2 < p < 4 and q = — — . 

4 — p 

There are several cut-off functions that are frequently used: let i = 1,2, 
7i,p,r : equals for t < logr — 1 on C Pi and equals 1 for t > logr + 1. 
Moreover 

7l,p,r + 72,p,r = 1 On X r 

A,p,r : equals for i < logr — T on C Pi and equals 1 for t > logr — 1. 
Vi,p,r : equals for t < logr + T + 1 on C Pi and equals 1 for t > logr + 
2T + 1 

Note that /3i, p ,r = 1 over the support of 7i, p ,r- We drop subscripts p, r 
from these notations unless we need to address their relations to p, r. 
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Convention: in this section, all constants independent of r are denoted 
by C, Ci (or e, e^) if we require them bounded from above (below). They 
may depend on proper sets U. 

Lemma 3.10. can be chosen such that \\V(3i\\ L 4 < C(log iV) -3 / 4 . 

Proof: For any function /, ||V/||£4 is conformally invariant. We can 
construct over cylinder or R 4 . We do this on cylinder. Define 

[ t < 

P(t,6) = < 1 t> logiV 

[ i(logiV) _1 otherwise 

\\V$\\ L 4 satisfies the requirement, However (3 is not smooth. Let h(x) 
be a smooth, compact supported function over R 1 such that J h = 1. 
Define 

0(t,O) = [ f3(s,6)h(t- s)ds. 
Jr 1 

It is easy to show that (3 satisfies the estimates. $ can be obtained 
from (3 by shifting and reflection on t coordinate. So V/3, have same 
estimates as V/?. q.e.d. 

Set (5 = C(log N)~ 3 / 4 . The following two lemmas study derivatives of 
our cut-off functions with respect to r, p. 



Lemma 3.11. Let f r = 7i, r) or 77^ 



V 



and for f r = f3^ r ,m 



\S-fr\<Cr-\ 
or 



W§- r fr)\\ L * < Cr~\ 



Here the derivatives of functions on X2 is interpreted as follows: 
these functions are well defined over X r and hence they induce functions 
over X. In fact, they are supported in B p (Nr). The derivatives make 
sense when these functions are treated as functions over X. 

Proof: We use (x, 9) to denote the coordinate of points on X and 
(s,9),(t,6) for points on cylinder. Suppose f r = (3\^. By definition 
(3i : r(x, 9) = 0(logx — logr + T, 9). Here (3 is defined in lemma 3.10. A 
direct computation shows that 

— Pi, r (x,9) =r~ l J 0(logx-\ogr-t)ti(t)dt. 
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So < Cr _1 . Same argument clearly works for j±. 

Since |V/|^4 is conformally invariant, we can compute it in terms of 

cylinder coordinates. So 

V (^/M*, 0)) = r ~ l f P'(s - log r - t)h'(t)dt. 

The second inequality in the lemma for (3i iT follows from this expression. 
Of course, the proof here works for 771. 

For 72 , fa j V2 note that the derivatives are only supported in annulus 
B(N~ 1 r, Nr). The proof essentially has no difference, q.e.d. 

Lemma 3.12. Let f V)T = ^ ijPjr , p ijPjr , or r] itPtr . 

\§^f P ,r\ < Cr-\ 

and for f r @i, p ,r-> Vi,p,r 

\\V(-^f r )\\ L *<Cr-\ 

Proof: Note that 

fgA x ) = f P A\ x -l\)- 

We explain how this works for the estimates by describing one case. 
Fix a point p, suppose f V)T = 0i,p,r- Pi,p,r(x) = P(log \x\ — logr + T). 
Then = /3(log \x — q\ — log r + T). Note that the derivative is only 
supported in annulus B p (N~ 1 ,Nr). So 

\§- q \ q = P Pi, q ,r\ = WQog \x-q\- logr + r) ^ X _~ g| l| g=p < Cr~\ 

Other cases are similar to the computations in lemma 3.11. q.e.d. 
For any [Ai] G Ui, choosing representatives of the class is based on 

Lemma 3.13. Suppose E is a trivial bundle over unit ball of R 4 . Then 
there exists a representive of a connection with A r = 0, unique up to 
A — > uAu~ l for a constant gauge transformation u. Moreover 

\A(x)\ < \x\sup\Fa\- 
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This is a well known result. We skip the proof. If A is chosen in this 
way, we say that A is a r-gauge connection at 0. 

Back to the construction of approximating solutions. Suppose that 
Ai are r-gauge. Define A\ = r/iAi. Note that A\ are trivial on the gluing 
area. Then 

\A i -A' i \<Cr,\F(A' i )\<C, (4) 

and 

\\F + (A> i )\\ L2 ,\\A 1 -A>\\ L4 <Cr 2 . (5) 

For any p G Gl, one can glue bundles E{ to get a bundle E p over X r 
using p. A\ automatically give a connection A' p . In general, we write 
A = (A®, A®, po) and let a = (w\, W2, v) be a tangent vector at A . 

Proposition 3.14. A' p are approximating ASD- connections, 

\\F + (A' p )\\ LP <Cr 1+ ^. 

Proof. Note that F+(A' p ) = F+(A[) + F+(A' 2 ). 

\\F + (A[)\\ LP < WV^Wlp + Win - ^AIWlp =:h+I 2 . 

Note that the integral area is in B Pl (Cr) and 

h < ||V77i|| L 4Pi|| L9 < C||V??i|| L 4pi|| L? . 

We can prove that h < Cr 1+4 /i by using lemma 3.10 and (4). The 
estimate of I2 also follows from (4) . So 

WF+iA'JUv <Cr 1+4 ^. 

Similarly, we have estimates for ||i ?+ (yl2)||LP- q.e.d. 

Proposition 3.15. Let a = (wi, u>2, e) be as above, then for z = Wi 

\\-^F+\\ LP <Cr 1+ ^\\z\\ Ll , P . 
Proof. By direct computation, 

11^+11 < C(\\V(3z\\ LP + - (3 2 )z A A\\ LP . 
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Note that Wi are tangent vectors of a finite dimensional moduli space, 
they are bounded and C°°. Moreover they are chosen so that Wi(pi) = 0. 
So we can assume < C\x\, for x € B\(p). Now the rest of the 

proof is same as the previous proposition, q.e.d. 

The reason for using r-gauge connections is to keep the gluing process 
gauge equivariant. 

Step 2: the right inverse of d\, : We assume that H\. = 0. This is the 

case that we are concerned with in this paper: when X = S 4 , H\ = 
for all ASD-connections including trivial connections; when I / S 4 , 
it follows our assumption that the metric g is regular and Fe = (rf. 
§2.1). Therefore there are right inverses Pi 

to the operators eft . Pi is uniquely determined by its image. In partic- 
ular, we can choose by requiring that the image of Pj is perpendicular 
to the kernel of d\. in L 2 (f]L p ). So 

\\Pit\\L« < CWZWlp. 

Since the moduli spaces Ui C Mi are smooth, the finite dimensional 
proper open sets and operators d\, , Pi are smoothly parameterized by 
the manifold Mi, so we have for z = Wi 

d d 
\\-Ql P i\\ < C\\z\\ L i, P{Xi) and \\-Q- z d%\\ < C\\z\\ L x, P{Xi) . (6) 

Define Qi = PiPiji and let Q = Q\ + Qi- We have 

(7) 

The proof can be found in ||. 

Proposition 3.16. For small r there exists a right inverse P to d\ 
over X r satisfying 

||P£IU« < CMU*- (8) 

Moreover for Wi G {TMijAi 

ii^peii L i, P <cii^ii L1 . P(Xi) iieiiL,. (9) 
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Proof: By (7) , we know that dX, Q is invertible if 5 is small enough. 

p 

Letting r small, we can choose a constant N to let S small, (see lemma 

3.10). Set P = Q(dX, (8) is obvious then. By definition of Qi and 

p 

(6) 

d 

\\-Q^Qi€\\Li,P(X) < C\\z\\ L i,p( Xt )M\\LP(X)- 

Same estimate holds for Q. (9) follows from the following expression of 
the derivative of P and existing estimates for Q: 

P' = (Qid+Q)- 1 )' = Q'id+Q)- 1 + P(d + A ,Q)'(d + A ,Q)-\ 
This finishes the proof, q.e.d. 

Step 3, constructing solutions: Here we use Taubes' argument. It is 
said that all solutions are of the form A p = A' p + a, where a = P(£) for 
some £. Then equation F + {A p ) = changes to 

d\,a + a A a = -F + {A' p ). (10) 

Apply P to the equation, 

a + P(aAa) = P(-F + (A' p )). (11) 

Proposition 3.17. Let 5q > be a small constant depending on U\ x 
XJi- For small r < 8q equation (11) has unique solution with 

\\a\\ L i, P < Cr 1+Z ' q . 

Let Wi as before 

\\^a\\ L i, P <Cr 1+3 '«\\wi\\ Ll , P , (12) 

Proof: recall that ||F+(^)|| LP < Cr l+i /i. By (8), the same estimate 
holds for P{-F + (A' p )). Let 

B = {feL 1 ^(n] { ( dEp ))\\\f\\<Cr 1 ^}. 
Define a map H : L 1,p — > L 1,p by 

H(a) = -P(a A a)- P(F + (A')). 
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It is routine to show that H maps B to B and it satisfies the contraction 
mapping principle (||). This proves the first statement. 
Taking derivative on equation (11), we have 

a' = -2P(a A a') - P'(a A a) + P'(-F + (A' p )) + P((-F + (A' p )'). 

Using the estimates for P', (F + (A'))', we have bound for L l,p norms of 
the last three terms. For the first term 

\\P(a A a)\\ L i, P < C\\a A cl'Wlp < C\\a\\ L 4\\a'\\L<i 

< C||a|| L i, P ||a'|| L i, P < Cr 1+3 /<*\\a'\\ L x,v < ^\\a'\\ L i, P . 

Absorb this term on the left side, we have (12). q.e.d. 
The map $ now can be defined as 

^(Ax,A 2 ,p) = A' p + a, 

where a is solved in proposition 3.17 and it can be treated as a map 
defined on U\ x U2 x Gl. Define A%, p) = A' p to be the splicing 

map. So $ = + a. 

Corollary 3.18. For any z = Wi G TM ki (Ai) or z = v G T x (Gl) 

llda^lU^^Cr^H^'^ll^ (13) 
Namely \\da(z)\\ <gC ||d$'(z)||. 

Proof: By the definition of the metric g r of X r , it is obvious that 
C||wi|| < Hd^^Wj)!! < C||wj||. So (13) is a consequence of (12). This 
result is still true for the tangent vector v. A simple way to see this is 
to replace Mk 2 (X 2 ) by Ml 2 (X 2 ) (= Gl x Mk 2 locally) and repeat the 
gluing theory. Now Gl is embedded in M® ko as a subspace. v can be 
treated as a vector in A^° 2 and so by the same argument we have (12) 
for z = v and so (13). q.e.d. 

Since we are concerned about the property of derivatives which is a 
local computation, locally we can put Gl x U 2 together and replace it 
by U 2 C A4^ 2 from now on. This is just what we did in the proof. Note 
that d<&' ~ Id, so the corollary implies that \\da\\ < Cr 1+3 / q . 

To study the diffeomorphism issue, it is useful to give a local co- 
ordinate chart for Me- The next few pages (up to remark 3.24) is 
contributed to this issue. 
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We begin with the study of the local structure of A around Im(&). 
Locally, let W = &(U\ x U®)- Let v G W be a fixed connection. Ae 
is indentified as a vector space with vq = 0. Define a map 

y : n°( QE ) xWxnl^As 

given by 

y(€, v, v) = exp(^)(w + P V T)), 

where P v is the right inverse to . In the definition, Q°(qe) is assigned 
L 2 ' p (X r )-noim; is assigned L p (X r )-norm; V and Ae are assigned 
L^-norms. We would like to conclude that y is a diffeomorphism in 
the neighborhood of V. 

Proposition 3.19. Let Vq = (0,t>,0). Let T = -D3V be the tangent 
map. 

T(£,Z,T]) = d v £ + Z + P V T). 

Then T is an isomorphism and 

||T|| < C, ||r _1 || < c. 

Proof: ||T|| < C follows from the definition of norms. Conversely, we 
first show that 

z, r7)|| <C\\a\\, 

where a = T(£, z, rj). 
We know 

d v £, + z + P V T] = Q, 

apply dy to the equation, 

d^d v ^ + dyZ + rj = d+a. 

It is easy to have 

< C\\a\\ 
\\d£z\\ < Cr\\z\\ 
\\d+d v C\\ < Cr\\d v £l 

On the other hand, similar to the argument in §, by the fact that 
H°{A) = H°{B) = (where v = &(A,B)), we have 

\\dvt + z\\ > C(\\d v C\\ + \\z\\). 
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Combine these inequalities together 

||ry|| = \\dyOi — d^d v ^ — d^z\\ 

< CH+Crdl^ll + W) 

< C||a|| +C'r(\\d v £ + z\\) 
= C\\a\\ + C'r(\\a-r)\\). 

Reorganize the terms of ||ry||, we have 

\\v\\ < C\\a\\. 

For £, z, the estimates follow from 

\\d v £ + z\\ = ||« — PvV\\ < C||ck[| . 

To see that T _1 exists, one can use the argument of using index theory 
(|). q.e.d. 

In general, 

Proposition 3.20. For any V = (£o,v,r]a) in the e-ball oJVq, D~$y is 
isomorphic. Moreover, 

\\Dy v \\ ^cjz^ 1 !! <c, 

where e,C are independent ofr. 

The idea is to compare the operators -DJV and -C*3V by a direct 
computation. A similar computation is already given in ||. One can 
find that the difference between two operators are controlled by e. Hence 
proposition 3.19 implies this propostion. 

As a consequence, 

Corollary 3.21. y is a local diffeomorphism around W . y(W x 
gives a slice for Ae- 

By the proposition 3.17, we know that the moduli space A4e near 
W is isomorphic to W. In fact, the isomorphic map is given by 

a(w) =w + ao (cf)')- 1 ^), w G W. 

So 

Corollary 3.22. (W, a) is a local coordinate chart for Me- 
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Proof of proposition 3.9: Clearly, <£' is a local diffeomorphic map. 
Using the coordinate chart explained above, <I> itself is $>'. So is a 
local diffeomorphism. q.e.d. 

Now suppose we have a smooth map / :W — > Me C Ae- Treating 
that W as a chart of .Mb, / induces a map from / = /a -1 from Mb 
to itself. 

Proposition 3.23. Let f(x) = f(x) — x. Suppose 

11/11 <e,\\df\\<e Q , 

for some small constant €q > 0. Then f is a diffeomorphism. Here 
the norm used for Ae is L l,p . Furthermore, if fi,i = 1,2 are two such 
maps, then f2 o fi 1 is locally diffeomorphic. 

Proof: Let v G W. We may assume that W is a linear vector space. 
Otherwise, we can identify W with the tangent space W v . Assuming 
v = in W. By y, we can assume that Ae is $}{qe) x W x Under 
this identification, suppose 

j(z) = (£(z),v{z)Mz))- 

f can be interpretted as f(z) = v(z) when W is identified with Me- 
By proposition 3.19, we see that 

\\dv-I\\ < C\\df\\ < Ce . 

So when eo is chosen small such that Ceo < 0.5, dv is an isomorphism. 
Therefore / is locally diffeomorphic. q.e.d. 

Remark 3.24. The proposition 3.23 is useful for local diffeomorphism 
issue. It says that if we can control difference of two maps f\ — fi 
up to its derivatives in terms of the norm of Banach space(l), we can 
conclude the diffeomorphism on the moduli spaces. One can see that this 
argument works because we are taking particular norms on the Banach 
spaces. Roughly speaking, one of the key is that our norms are with 
respect to the metric of X r instead of X. 

We now discuss our gluing map By comparing to $ and so 

following the remark 3.24, we are able to show that it is a local diffeo- 
morphism. The main difference between *$>t an d $ is that for we 
allow the gluing radius r and the bubble point pGlto vary. In order 
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to make diffeomorphic, we know just by dimension counting that we 
have to replace Mk 2 (S A ) by M\ 2 - Locally, the map is defined to be 

^ T (A 1 ,A 2 ,p,p,r) = <S> p , r {A x ,A 2 ,p). 

This is well defined globally ( |]2l[| ). Also, by our convention, locally we 
combine (A 2 ,p) together to be an A2 £ U 2 °. Let 

Fix a point B° = (A^, A%, r ,p). Let Vi(ei) and V 2 b '°(ei) be t x neigh- 
borhood of A® in A4k 1 (X\) and -M b ^. We construct a natural smooth 
map D Ptro : V x (e x ) x V*' ^) -» M fel (*) x A4° 2 x R+ x Let 
f> : M° k2 (X 2 ) -> TWfcf be the projection map and V? = h"^^' ). For 
any balanced ASD-connection A 2 € V^'°(ei), the fiber h _1 j4 is in the 
form of A 2 (t(- — y)), where i G [1 — e, 1 + e], y G -B(e). Define 

D Pjro (Ai,A 2 (t(--y)) = (Ai,A 2 ,r *- 1 ,p + rgy). 

By the geometric meanings of t, y, r, it is clear that D Ptro ~ $p,r 
over Vi x V^e). 

Proposition 3.25. Let Sk? = ^(p,r )- is diffeomorphic over 
V\{e) x V 2 {e) when e is small. 

Proof: Let ^5 = — $p,r - We know that 3>p,r is diffeomorphic by 
proposition ?. By proposition ?. It is sufficient to show lemma ?. q.e.d. 

Lemma 3.26. < Cr^ 9 ||d*p >P0 1|. 

Proof: Let A = D^B . Suppose that z is a tangent vector at A 
and z' = dD p>ro (z). Let = W T oD pro . The difference ^>s is generated 
by two terms: 1) ^>' T — & p>ro and 2) — ® pro - When tangent vector 
z is in (TVi) A o x {TV 2 b,0 )^o, ^5 = 0. So only r-direction vectors and p- 
direction vectors are nontrivial. To simplify the notations, we consider 
them separately. The case for combinations of different directions can 
be proved by the same arguments. 

Case 1: z' = r Q J^. 

Here with factor ro, is normarlized such that \\z\\ ~ 1. The proof 
consists of two lemmas. 
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Lemma 3.27. \\d& T (z) - d& Piro (z)\\ < Cr^ q . 

Proof: Suppose A r is the path representing z at A ro = A 2 . And 

Ar = A 2 {^.). 

r 

By definition ^' T (A 1 ,A r ) = $ Ptr (Ai,A 2 ). On X 

^' T (A 1 ,A r ) = mA A i) + V2,rA- 

So 

d . ~, d d 

-^\r= ro ^T\ A U A r) = ^|r=r ^l,r^l + ^|r=r ^2,r^2 + V2,r z - 

Hence 

N*t(^) ~ d®p, ro (z)\\ L i,p < || — | r=ro 77 lir ^i|| L i, p + \\ — \ r=ro r}2,rA 2 \\ L i,v 

Using lemma 3.11, the estimate is routine, q.e.d. 
Similarly we also have 
Lemma 3.28. 

W^F+^A^AMl, <Ct^ 

W^-Q^A^Ar))^ <Cr~ l 
d ^ -i 



t ^^P(^' T (Ai,A r ))\\ L i, P < Cr 



and 



\\^s(Ai,A r )\\ Ll:P <Cr 3 /i (14) 

Proof: Most of the proofs are similar to the previous one. We explain 
the proof for J-jQ. Q = Qi + Q 2 - We prove for Q 2 . Q\ is similar but 
slightly simpler. By definition Q 2 (r) = P 2 ^ r PA r l2,r- For any £ 

d d d 

l|7^Q2(r)£|| L l, P < \\-^p2,rPA 2 72,r C\\L^P + \\/h,r ^-FXTVoSIU 1 * 

d 

+ \\(32,r Q PA 2 -Q^l2,r£\\L^v =■ h + h + ^3- 
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Estimate I\\ 

h < ||V(|;^ ir )|| L 4||P i4a 7i 2)ro e||Li.P + |^ > r|||PA 2 'yi2 1 roeilLi.P 

Here we use lemma 3.11. 
Estimate I2: 

h < iiv/3 2 , ro ii L 4||^p Ar ( 72 , ro e)ii L1 , P < ci^iiiP. 

The estimate of I3 is similar. 

For P we use the expression in proposition 3.16 and the estimates for 
Q. 

(14) is a consequence of the expression of a (see (11)) and all estimates 
we just have, q.e.d. 

Combining these two lemmas, we prove the case 1. 

CclS6 2 1 Z = T*q . 

Here d/dx is a unit vector in TM p . The factor Tq normalizes the vector 
z such that \\z\\ is some constant ~ 1. This is due to the definition of 
D Pt r ■ 

Lemma 3.29. \\d& T (z) - d& Ptro {z)\\ < CrJ /? . 

Proof: Let p(t) = p + tx be a path in M. Besides the changes of cut- 
off functions in the construction of gluing maps, there is another subtle 
change. Recall the map Q (Ai, A 2 ). We require that A\ is r-gauge 
with respect to p{t) before applying 5>' to it. Denote connections as 
M,p{t)- Set 

~ d A 

This vector only depends on A\ and p. So \z\ii, P < C. Let (Ai, A2 1) = 
D-^ (A 1 ,A 2 ,p(t)). By definition, 

& T {A U A 2<r 2 t ) = Vl, p ^ t),ro A l,p(r 2 t) + V2,p{rlt),r M- ~ tx). 

Take derivative with respect to t: 

— *t(^1> A 2,r%t) = rl{-^ril,p,r Al + m,p,r Z + ^V2,p,r A 2 ) + V2,p,r Z 
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So 

\{d& T - d<S> Pjro )z\ < rl\— T)l, P ,r M + Vl,p,r Z + g^V2,p,r A 2 \ 

Because of the factor Tq, we indeed get better estimates than the one 
stated in the lemma, q.e.d. 

Similarly, we also have 

Lemma 3.30. ||f ¥«(A liP(t) , A 2 )\\ L i, P < Cr 3 /«. 

Combining these two cases, we prove lemma 3.26. q.e.d. 
Part II: The gluing map is injective. 

Here For simplicity of the notation, we prove the injectivity for <I>. We 
state the result as 

Proposition 3.31. <I> is injective. 

Proof: Suppose there exists two points in Me 
B = *(Ai,A 2 ,7);B' = ^(A[, A' 2n ') 
such that [B] = [B'], i.e, there exists g € Qe 

B = gB'. 

Assume K = $'(Ai, A 2 , 7) and K' = $'(Ai, A' 2 , 7'). Without the loss 
of generality, we may assume that L^-norms of Aj, A^'s are bounded. 
By proposition 3.17, 

\\K-gK'\\ < e. 

Now, we want to find (A'/, A 2 ', 7") such that for K" = <f>'{A'{, A 2 ', 7") 
[K'\ = [K"\ and 

\\K-K"\\ < Ce. 

This can be done as following: since K, K' 's norm are small at the 
gluing area, so is \\dg\\. Now fix a point xo in the gluing area, for 
example, \xq — p\ = ro- Let go = g(xo). we can choose gi = g over 
X\B p (r^ 2 ) and be constant go in B p {2ro). Similarly, (on the 5 4 side), 
let #2 = 9 over B p (tq) and be constant go outside B p (tq/2). One can 
show that the choice 

A'{ = 9l A[,A' 2 ' = g 2 A> 2 , 7" = g-^'go 
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satisfies the requirement. Let B" = $(A", A' 2 \ 7")- We know that [B] = 
[B"\ and 

\\B-B"\\ < Ce. 

However, by proposition 3.21, they are located in the local diffeomorphic 
region. So it provides the injectivity. q.e.d. 

Modulo some trivial consideration, the main idea of the proof for ^ 
is identical. We skip the proof here. So we complete the proof of 
proposition 3.8. q.e.d. 

Now suppose k 2 = 0. Then M b = {0}, S T (X) = M kl {X). GL T 
has a natural SO(3) action on itself. In fact, GLt = (X). The 
gluing theory does not make sense for vl/^ for this case. The reason is 
that T is not a bubble tree. But a similar result to proposition 3.9 can 
be obtained: 

Proposition 3.9': For any open proper set U C A4 kl (X) there exists 
a small constant €q, depending on U , and a gluing map 

<& : (GL'(p) n gl-\U))/SO{?,) M kl (X). 

such that $ is diffeomorphic. 

SO (3) is the isotropy group of the trivial connection A 2 . The treatment 
for nontrivial isotropy group is already given as in |J. This is the 
obstruction of constructing a smooth compactification via the bubble 
tree methods. 

We now discuss the general cases. Suppose a bubble tree (T, vq) is 
given. We describe <Sy, St(X) and define <S T '°. This is done inductively. 
The base case is that T has only one vertex vq. Then S^, = Af^/ n, 

St(X) = Ai w (v ) and 5^'° := •M.^f, Vo y We first assume that T is not a 
ghost tree. 

Case 1: S^,. 

Let v be a vertex of T. Let child{v) = {v\, . . . , v n }, and Suppose that 
Wi = w(vi). Let Vo v = (w\,... ,w n ). Suppose that for all subtrees 
(t(vi),V{), <S t 6 („.) has been constructed. Let be the sphere assigned to 
v. Then 

n 

St(v) = -^t(«),w(«) X n 5 i(fi)- 
i=l 

Let P be an S'f7(2)-principal bundle over S% with cq,{P) = w(v). Define 
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bundle 

PS = [Mlw *S P (P\oo x [J P)^M b w(v) x(St\{oc}T]/S Wv . 

i£child(v) 

Note that M b w{v);W{v) is a subspace of M b w(v) x (S* \ {oo}) n /S nv . P^ 
defines a bundle over M b w{v ^ w{v y 

Vi&child(v) 

Drop P|oo from P° and denote the bundle by P„. Then we define 

GL T = P„ x n ^ cW ^ o) 50(3) ( J] x r+ »- 

Vi£child(vo) 

Case 2: S T {X). 

Suppose child(vo) = {v±, . . . ,v n } and to vo = (W(i>i), • • • ,W(v n )). For 
each V{, we already have ^t(v)'^t(v ) m ^ * s ass ig ne( i to «o and each edge 
(vq, Vi) is assigned a bubble point pi. Tuples (pi, ... ,p n ) are parameter- 
ized by B„ := (X n \A)/S m , where A is the big diagonal. Let P — > X 
be the SU{2) principle bundle over X with C2(P) = w(vq). Define 

P t = Fr(X)x so{4) S b {Vi) ^X 

and set 

s T (x)=[m w(vq) (x)x( n p^^i n x)]/5 W „ o . 

Vi£child(vo) Vi€child(vo) 

Also, define 

n = M w (v ) *G P II (Px Fr(X)) -> M fflN) x ]J X. 

ViEchild(vo) Vi£child(vo) 
The bundle of gluing data is 

GL T = [P, X n ^ cW ^ 0) (5O(3)x5O(4)) ( II ) X ^ + ))]/^o • 

Define GIt to be the fiber of GL^. By definition of GL^, 

Gl T = ]J(R + x 50(3)), 

eeD 
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where D is the edge set of T. Namely, each edge e corresponds to a glu- 
ing parameter R + x SO (3) and we write this SO (3) to be SO(3) e . Treat 
SO (3) as SU(2)/Z 2 , we get R + x SO{3) = R A \ {0}/Z 2 . Compactify 
GLy by adding 0-section to the bundle, it is then a R 4 /Z 2 bundle. We 
denote GL^ for these compactified bundles. However, for simplicity, 
we still use GL^ instead of GL7 1 . Including 0-section or not can be 
distincted by the contexts. Correspondingly, -R 4 /Z 2 generated by the 
edge e is written as i? 4 /Z 2 . For simplicity, at this moment, we do not 
want to be bothered by the stabilizer S w 7 s. For the time being, we as- 
sume that all S n = 1. Given a vertex v in T, suppose ei,... ,e n are 
edges connecting v. Define GL(u) to be a sub-bundle of GL7 1 with fiber 
isomorphic to FJ^i? 4 /Z 2 ). 

Suppose T is a ghost bubble tree. One can still define a bundle GLy 
as above. But there are more structures on this bundle. Suppose v is 
a ghost vertex. There is an SO (3) action on the fiber of GL(u). The 
action is along fiber and it acts on each SO(3) e as group multiplications 
on suitable side. Hence, there is an S'0(3)-action on GL^ given by 
each ghost vertex. Suppose v\, ... , v\ are ghost vertices of T. Each Vi 
corresponds to an SO{3) and denoted by SO(3) Vi . Define the isotropy 
group of T 

1 

T T = \{SO{3) Vt . 

i=i 

The gluing data now is GL^/r^. For non-ghost bubble tree, define 
r T = 1. 

Suppose edge set of T is D = {ei, ... , e n }. GIt = Y\i We 
now define a map ip : GIt — > Tk- Suppose x = (xi, . . . , x n ) G GZt- We 
contract T at edges ej for those Xi 7^ and set ^(a;) to be the resultant 
tree. For any index set I C {1, , . . . , n} let x = (x±, . . . , x n ) be any 
point such that x% 7^ iff i E I. Let T' = ip(x). Define 

Gh,T> = ([[(Rf \ {0})/Z 2 ) x {0} c Gl T 

16/ 

and GLt^t' to be the sub-bundle of GLt with fiber GIt,t'- If nontrivial 
S n is also considered, Tt and GIt,t> are still well defined. Then the 
gluing theory for the general case is 

Theorem 3.32 (pill). Let (T,v ) G T K . For any proper set U C 
St{X) there is a small constant cq and gluing map 

* T : (GL T (eo)\u)/r T ^Mk(X), 
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such that maps G1it,T' AY to St> diffeomorphically for any T' > T. 
Let 

V(X,K) = {(M/ T 1 ((GL T (e )| C /)/r T ),^ T 1 )|T G T K } 

where U, eo are chosen as in the theorem. This defines an atlas for 
Mk{X) and makes it into a topological space. Moreover 

Corollary 3.33. M K (X)\S K (X) is an orbifold. 

Proof: The only thing that needs to be checked is that the transition 
maps between different charts are continuous. This is proved in §7.2.8 
H or referred to §3.3. q.e.d. 

All the constructions and results works for parallelly. 

We now have topology defined on A4k(X) via gluing. On the other 
hand, we also have the Parker- Wolfson bubble tree compactification 
theorem. It has been known that the convergence defined in bubble 
tree compactification is compatiable to the given topology on Mk(X). 
We now apply the Parker- Wolfson bubble tree compactification theorem 
to show the compactness of A4k(X) and M K . 

Theorem 3.34 (@). 1) For any sequence {L4 n ]}^i C M K (X) there 
exists a subsequence such that it converges to a bubble tree instanton in 
Mk{X) via bubble tree compactification process. 2) The bubble tree 
compactification is consistent with the topology defined by T>(X,K). 3) 

For X = S A ,M K is compact. Otherwise, if Fk = 0, M-k(X) is 
compact. 

Proof: The statement (1) is just the Parker- Wolfson theorem. The 
second statement can be proved by the gluing theorem. More precisely, 
suppose the limit A G St- Apply to U X GIt(c), where U is a 
small neighborhood of A in St- Since $>t is a diffeomorphism map, 
one can show that [A n ] is in the image of *$>t when n > N for some 
N. We skip the details. One can find the proof, for example, in |2~i"|| . 
Now we show (3). Suppose that {A n }^ 1 is a sequence in Mk(X). 
Take a positive sequence 5 n — > 0. If A n is not in M.k(X), it is in 
some chart (^ , ^ 1 (GL'p(eo)|f7), ^y 1 ). Suppose A n = ^(A^,6 n ), where 
A' n G U, b n G GIt- Choose any b' n G GIt such that \b n — b' n \ < 5 n and 
A n = ^x{A' n ,b' n ) G M.K- Now we have a sequence A n in A4k(X). 
Apply the bubble tree compactification to this new sequence and get a 
limit A G Mk- Clearly, this is also the limit of A n . q.e.d. 
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3.3 Smoothness of Mk(X) 

From the previous section, we know that Mk(X) \ Sk is a manifold. 
It is natural to ask if it is smooth. The problem is to study the glu- 
ing maps given by different strata. A straightforward way is to show 
that the transition maps between two gluing maps have certain degrees 



of smoothness. In [20], a similar situation is treated and the author 
proved that the transition maps are C . By remark ?, a result as ? 
is the key. In particular, it already provides C°°-smoothness. In this 
subsection, we change the strategy slightly here: we "patch" the gluing 
maps by perturbing one of the gluing maps. Hence the transition maps 
are relatively simple and they are smooth automatically. 

We illustrate the difficulty of the problem for the first nontrivial 
case. Let T be the bubble tree corresponding to bubble tree manifolds 

where p € X, q € Sf. GIt = R1/Z2 x R2/Z2 and the coordinate 
on it is denoted by (x^x?). For a proper open set U C St, let 
(^ , y 1 (C/xGZ'r(4eo)), ^t* 1 ) be a chart. At the mean while, there are 
two other charts intersecting this chart. Define the bubble trees Tj = 
ij T (Rf \ {0} x {0}). Set V t = Bi(e ,4e ) x {0} C Rf x {0} and 
Ui = *$>t,Ti(U x Vj). Ui C &r i are proper in their strata. These 
also provide coordinate charts (UikGlT^i)), ^y 1 )- The problem 
is whether the transition maps on the overlaps of these three charts are 
smooth. For example, consider charts 

(^ 1 ([/xG/ T (4e )),^ T 1 ) and (^(^xG^fe)),^ 1 ). 

We prove 

Proposition 3.35. There is a perturbed gluing map VPj^ of *$>Ti such 
that the transition map between charts (^^(UxGlTC^eo)),^^ 1 ) and 

{^ t ^(UixGIt(^i)),^Ti) * s smooth for small e\, where e\ depends on 
eo- 

Proof: We split the perturbation of *&T\ into two steps. 
Step 1: We know that the map \£t,Ti : UxVi — ► S^ is diffeomorphic 
by theorem 3.32. For any proper U that we consider in the gluing 
theory, when eo, depending on U, is small, we may assume that GL^ 
is a product of two bundles GL T whose fiber is Rf/1,2- Pull back GL T 
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to UxV\ and denote it by GLf^. (*r,Ti)*GLy 1 gives a trivialization 
Bt,t x f° r GLt x over ^T^iUxVi) by a proper bundle map 

Bt,t x '■ (^T,Ti)*GL^ 5l — > GL^. 

So, without the loss of generality, we can use coordinates of GL^ for 
both GLt and GL^ • Define 

^ : UxGh^eo) ^ St 1 xGI Ti ,^ 1 t (A,x 1 ,x 2 ) = (tf T (A, Xl , 0), x 2 ). 

We first construct a map that is in the intermediate stage between 
vE't and ovE^. For simplicity, we assume U consists of only one point 
A = (A\, A 2 , A3). Recall that *r(A, £1,2:2) is constructed by: splice 
A\,A 2 , A 3 together using gluing parameter x\,x 2 first, then solving for 
instanton. Formally write these two steps as 

V' T (A,x 1 ,x 2 ) =A4 xl A 2 $ X2 Az = (A^ X1 A 2 % 2 A 3 

and 

* T (A, x u x 2 ) = (A^ X1 A 2 % 2 A 3 + a((^ifc 1 A 2 )fc 2 A3), 

where A\\ X A 2 denotes the splicing of A\ and A 2 with respect to gluing 
parameter x. Now ^1 is defined to be 

*i(A,xi,x 2 ) = (Ai$ xl A 2 + a(A^ xl A 2 )% 2 A 3 

+ a((A^ Xl A 2 + a(A^ xl A 2 )% 2 A 3 ). 

The real meaning of this expression is that we glue A\, A 2 to an instan- 
ton by xi first and then glue the resultant instanton with A 3 by x 2 . Our 
first perturbation is 

Claim 1: ^1 can be perturbed to a new gluing map ^1 defined on 
[/xG7 T (4eo) such that = *i over UxGl T (2e ). 
Proof: Let 7 be a cut-off function such that ^(t) = l,t > 3eo and is 
supported in t > 2eo- Define 

^l(A,Xi,X 2 ) = [Aittxi^2 + 7(kl|M^l|)xi^2)]ltx2^3 

+ a[(A^ xl A 2 + 7(|xi|)a(AiU xl A 2 ))|tx a A3]. 

Obviously, satisfies our goal. We need to show that it is also a gluing 
map. As we did before, we compare the map with ty' T and study the 
derivative of V&i — ^' T . As usual, we can show that 

||d^-d*i|| <C{\ Xl \ + \x 2 \f/ q . 
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and 



H^A,*!,^) - & T (A,x 1 ,x 2 )\\ L i, q < C(H + \x 2 \) 1+z/q . 

This is sufficient to imply that ^ is a gluing map. q.e.d. 

Let tytp = V^i^^) -1 . We have shown that the transition map between 

charts (^ T (UxGl T (2e )) 1 ^ T 1 ) and (V^UxVi x B 2 (4e ))), (W Tl is 

T" _ 

S'tep 2: One might think that Vl/^ (or *i) is intuitively, however 
they are different because the metrics used for two gluing maps are 
different. This problem is dealt with in 

Claim 2: there exists a small constant e± > and a gluing map *$>Ti such 
thatH Tl = *Ti over ^T(Uy.(B 1 (2eo,3e ) x B 2 (ei)) and^ Tl = *Ti over 
^(Ux(B 1 (>4e )xB 2 {e 1 )). 

Proof: The difference between these two maps over ^^(C/xi?i(3eo,4eo)x 
-62(^1)) can be seen by what follows. We make a convention that the 
metric we refer to when we do comparing is the one induced by X^ Xl S 4 . 
Let B = ■^t ! t 1 (A 1 ,A 2 ,x 1 ,Q). Then 

V' Tl (*^(A 1 ,A 2 ,A3,x 1 ,x 2 )) = B$ X2 A 3 + a'(B$ X2 A 3 ) 
^ Ti (*t(^i^2,^3,xi,x 2 ) = Btj X2 A 3 + a(B$ X2 A 3 ). 

Here, a' and a, given in proposition 3.17 by using different metrics on 
X, are different. VlZ^i uses the standard metric while ^f' T uses the metric 
over X$ Xl S A . We know that 

\\da'\\ < Crl ,q \\d<S' Tl \\. 

We also have the same estimate for using the standard metric on 
X. Translate these estimates to the metric we are using now, then 

||da|| < C(eo)4 /q \\d^ Tl \\. 

To glue two maps together, one can do either of the following ways: 

1, deform the metric on X with respect to gluing parameter x such that 
the metric is standard one when \x\ > 3eo; 

2, deform the right inverse P's. To be precise, let P,P' be the right 
inverse used to define a, a' . Note that we have a simple fact:AP + (1 — 
X)P' is still a right inverse. With this, we can easily deform P's by 
cut-off functions. 
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If we choose t\ small enough, then r2 is small. By either way, can 
be deformed so that it matchs when \x\\ > 3eo- This glues ty' T and 
*Ti together, q.e.d. 

We have finished the proof of proposition 3.35. q.e.d. 

Remark 3.36. We make a remark on a compatibility property of gluing 
maps o,nd ^Ti ■ 



{^ T , Tl )*GL T GL Tl (f T , Tl ),GL^ GL Tl 



Ti 



y T — ^ Vti Vt ► Vt x 



In the first diagram, the map "?" does not agree with the transition 
map provided by the gluing theory, while in the second one, we know 
the transition map is the composition of three smooth maps. Therefore, 
the transition map is smooth. So, roughly speaking, the patching is the 
patching of gluing bundles by Bt,T\ up to ^t,t x ■ 

We can prove the main theorem in this section 

Theorem 3.37. Let K > be any integer. There is an atlas V(X, K) 
for Mr (X) such that the transition maps are C°° . So Mk(X)\Sk(X) 
is a smooth orbifold. 

Proof: The construction is based on the previous proposition. Recall 
that we defined a partial order on Tk- We introduce the charts for 
strata with respect to this order inductively and require that the atlas 
satisfies several requirements (Rl, R2, R3) 

Rl: To each stratum St only one chart {^t{Ut x GIt(^t)), ) is 
assigned, where Ut is a proper open subset of St- To simplify notations, 
we denote charts as chart(Vr,^T), where Vr = ^!t{UtxGIt{£t))- 
R2: For any two charts chart (Vt^t), chart(V T > ,~$>t'), Vt H V t ^ 
iff either T < T or T < T. 

We start with lowest strata which are compact. Choose a eo > such 
that for each T, (^t(St x GIt(go)^ ^y 1 ) & ves a coordinate chart for the 
neighborhood of St- Set ^t = *r- Moreover, for small eo, R2 can be 
satisfied. Let U C Mk{X) be the union of all open sets in given charts. 
This set is updated as charts are created. For any stratum T' whose 
chart has not been constructed, gluing maps of given charts induce a 
gluing map over Gt->T'\(ur\S T i) ( we simply say "over UC\Sr" later). The 
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statement is 

R3: There is a gluing map over U DSt induced by denned gluing 
maps of lower strata. 

This is defined as following: for any point (A, x), there exists a lower 
strata T < T' such that A = ^^/(B, x'). Then define the induced 
gluing map 

*£,(A,x) = V T (B,x',x). 

Note that may not be well defined since there may be more than one 
choice for T. To solve this, it is equivalent to require 
R3': For any two charts chart (V^ , ^Ti) and chart(VT 2 ,^T 2 ): *Ti 
agrees with ^t 2 on the overlap of their charts. 

There is no such a problem for the base case. Now suppose we are ready 
to give a chart for some T € Tk- This means that all strata lower than 
St already have charts. So St \hi is a proper set in St- By induction 
hypothesis R3, we already have a gluing map ^ T over M := hi n St 
induced from given charts. Now shrink M to M\ and then to M<i- Using 
^t over St\M2 and arguments for proposition 3.35, we glue ^t and ^f T 
together and define a new gluing map ^>t over GLt(£t) \ M2 such that 
it agrees with ^ T over M\. Here 6t is a small constant depending lower 
charts. Now modify the lower charts such that hi n St is M\ other than 
M. (This hi does not count current chart.) This will guarantee R3' and 
so R3. Of course, we can make 6t small enough to guarantee R2. Rl 
is automatically true. The transition maps are smooth as explained in 
proposition 3.35. The new atlas is denoted by V(X,K). q.e.d. 

R3 is the generalization of remark 3.36. We call it the compatibility 
property of the atlas T>(X, K). Because of this property, we can identify 
open set in every chart as a gluing bundle for corresponding stratum and 
they are patched together by bundle maps Bt,t> ■ This is very useful to 
our flip resolutions in §4.3. In fact, this was one of the main motivations 
of this section. 

4 Flip Resolution and Smooth Compact ifica- 
tion 

In the previous chapter, we constructed the compactified space M. via 
bubble tree compactification. Although certain smoothness and orb- 
ifold structures are achieved, there are still singularities wherever ghost 
strata occur. Suppose T € Gk is a ghost tree with g(T) ghost vertices. 
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The non-finite stabilizer is the product of g(T) copies of 50(3). Due to 
these obstructions, the advantage we can get from the bubble tree com- 
pactification, as compared to the Uhlenbeck compactification, is limited. 
Our task in this paper is to construct an orbifold compactification. The 
ingredient to solve this problem is the following key observation: Mt is 
obtained from some manifold by g{T) "blow-up" s. This structure is very 
similar to the Fulton-MacPherson (briefly indicated by FM) compacti- 
fied space. Using these "blowing-up" s to absorb the stabilizer 50(3)'s, 
we can apply " blowing-down" s and hence resolve singularities at St- 
We call this procedure the flip resolution. 

4.1 Flip Resolution and Smoothing M. K (X) for K = 2 

We start with the simplest case K = 2. For K = 2 we only need one flip 
resolution, so it is easy to see how the flip resolution works. For general 
K, we need to apply flip resolutions sequentially. We will describe this 
later. 

Suppose K = 2. The moduli space Mk{X) contains only one ghost 
stratum St where T is described as follows: the vertex set V of T is 
V = {^0)^1)^2,^3}- child(vo) = {«i}, child(vi) = {^2,^3} and v\ is a 
ghost vertex. w(v 2 ) = w(v 3 ) = 1. R(T) = [K - 2[0[1, 1]]]. 

Let STX be the sphere bundle of TX with the standard Z2 action 
fiber- wisely and 

Z t = Mk-2{X) xX. 

Then 

Lemma 4.1. St = Mk-2{X) x STX/Z 2 is a bundle over Zr- The 
gluing bundle GL^ is the bundle with fiber 

Gl T = Ro/%2 x Rf/Z 2 x R%/Z 2 - 

T T = SO (3). The gluing data is GL T /T T . 

The lemma follows from example 3.4 and the description at the end 
of §3.2. We skip the proof. By the construction of GL^, we know that 
GL-r is a pull-back bundle, say GL^, over Zr- In other words, 

GLit = St x z T GLy. 

The neighborhood of the stratum is GL^/50(3). In particular, 50(3) 
only acts on GL^. It is worth to take a look at other strata around St 
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and see how they change after flip resolutions: let (xo, £1,2:2) denote 
the point in Gl T , set T = * T (1,0,0) and Ti = * r (0, 1, 1). S To is the 
stratum that contains two bubble points on X. Stx is the stratum that 
contains only one bubble point on X with energy 2. For simplicity, 
we assume that M.k-2{X) is a point. Then the compactification of 
<Sto is the real blow-up of X 2 along diagonal modulo Z2 action. The 
compactification of Sr 1 is still singular at St- They intersect at St- 

Definition 4.2. Suppose N is a compact space and U — > TV is a disk 
bundle. Let S — ► N be the sphere bundle ofU. The space U = [0,1] x S 
is called the semi-blow- up of U along N. We say that {0} x S is the 
semi-blow-up boundary ofU. Conversely, ifU = [0, 1] x S where S is a 
sphere bundle of some disk bundle U over N, U is call the semi-blow- 
down of U. 

This is a rather trivial definition, however it is the case corresponding 
to our general model. Now let us see how we apply semi-blow-up and 
semi-blow-down to G~Lt/ SO(3). To simplify the notations, we ignore 
the orbifold structures and replace R 4 /Z 2 & SO (3) by R 4 & SU(2). In 
general, it is easy to see that the construction still works for orbifold 
cases. The resolution consists of two steps: 

Step 1: semi-blow up GLy along St- We call the resultant space GL^. 
This space is described as follows: semi-blow up GL^ along Zt- The 
resultant bundle is 

(GL^) tt = Z T x([0, 1) x S 7 ) =: [0, 1) x SGL^. 

So 

GL T /SU{2) = [0,1) xS t x Zt {SG-L' t /SU{2)). 

Step 2: semi-blow down [0, 1) x St along Zt- Since St is a sphere 
bundle over Zt of some vector bundle, say St^, we can apply the semi- 
blow down and replace [0, 1) x St by St d- Eventually, GL<r is replaced 
by 

GLt = S T4 x Zt (SG\, t /SU(2)). 

Clearly, none of points has non-finite stablizer in GLy. The procedure 
here is similar to the flip in algebraic geometry. So we call such an 
operation flip resolution. The following is the formal definition. 

Definition 4.3. Let G be a group acting on S 11 ^ 1 freely and it induces 
an action on R n . Let SN,SV be sphere bundles of the vector bundles 
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N, V over X, where V is n-bundle. There is a natural diffeomorphism 
f : SN x x (V\X)/G -> N\X x x SV/G 

by 

f(n, [v]) = (\v\n,[^)). 

\v\ 

Replacing SN XxV/G by N Xx SV/G via f is called flip resolution. 
Treat X as 0-section of N , we callXxxSV/G = SV/G the exceptional 
divisor ofNxx SV/G. 

We denote the new space by M. K {X). So we have 

Lemma 4.4. A4 K (X) is a smooth orbifold. 

To have a better understanding of this new space, we explain what 
the completions of St and are in the new space. The exceptional 
divisor takes the place of the ghost stratum St- It is A4k-2{X) x 
Xx(S 11 / SU{2)). One can check that the completion of St intersects 
the exceptional divisor at Mk-2{X) x X x {[1,0,0]} = Mk-2{X) x X. 
So 

Lemma 4.5. St in M K (X) is Mk-2(X) x Sym 2 (X). 

One can also prove that the completion of Sfi is a smooth sub- 
orbifold in M. K (X) and it is 

M K - 2 (X) x Fr{X) so{i) x Ml 

where M. \ is obtained from m\ by flip resolution in a similar way. Note 
that original Ai 2 is also singular. 

4.2 FM-Compactification and Ghost Strata 

We just explained that how to apply the flip resolution to the moduli 
space when K = 2. As K gets bigger, the ghost strata are very com- 
plicated and interwoven. Fortunately, they basically follow the rules of 
the FM-compactification of configuration space F(X,n). 

Review of the FM-compactification. 

Suppose X is a smooth, compact complex manifold. Let X^ := X n /S n , 
where S n is the permutation group. This space is also denoted by 
Sym n (X). (In the construction is given for X n .) Define 

F(X,n) =X^\A, 
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where A is the big diagonal in X^ n K In general, for a tuple I = 
(ii, . . . , if;), 1 < i\ < ■ ■ ■ < ik = n define 

A/ = {(xi, ... ,x n ) e X n \x h = ■■■ = x ik }. 

Let s : X n — > X^ be the projection. Aj also stands for s(Aj), for 
simplicity. So A = UA^ji. In [11] a compactified space F(X, n) is 
given. The construction can be described as follows: 

1) (complex)-blow up X n along diagonal A( lj2) ...,n) and denote the new 
manifold by X™. Accordingly, all other A/ are changed after blow- 
up. We still use A/ for the updated diagonals. Note that diagonals 
A/, \I\ = n — 1, are disjoint in X™. 

2) Inductively, suppose XJ! is created. If k = 2, we are done. Else, 
one can also prove that diagonals A/, \I\ = k — 1, are disjoint in XJ^ by 
induction. Blow up X2 along all these diagonals and create X?-,. As 
a result, A/, \I\ = k — 2, are disjoint. 

3) The action of S n on X n has a natural extension to X^. Let 



F(X,n)=X2/S n . 

There is a projection map FM : F{X,n) — > X^. It is clear that 
reversing the blow-up process and blowing-down F(X, n) we retrieve 
X^ n K Intuitively, we introduce blow-ups wherever two or more points 
run into one another. 

When X is a real manifold, one can also construct F(X, n) using 
real blow-up. Here, we use "semi-blow-up" to form a "stratified" space 
F(X, n). Each stratum is isomorphic to some Mt- We explain the 
construction of F(X, n) for n < 3. It can be generized to all n. 

Example 4.6 (n = 2). The space X% is the semi-blow-up of X 2 along 
diagonal. And F(X,n) = X\jS2- Let (AT 2 ) a be the real blow-up of X 2 
along A. Then 

F(X,n) = (X 2 ) A /S 2 . 
So when n = 2, this coincides with FM-compactification via real blow- 



ups. X\ has two strata f^i and 2 .' ^i is the boundary B of F(X,2) 
and 0,2 = F(X,2). 

Example 4.7 (n = 3). AT| is the semi-blow-up of X along the diagonal 
A( 1)2; 3). It is a manifold with boundary B123. &123 : -B123 — » X = A( lj2)3 ) 
is a sphere bundle over X . We can describe points on b^ 2 \ {%) as follows: 
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suppose z G b 123 (x), z can be written as z = (x, [x±, X2, X3]), Xi G TX X 
such that 

x\ + X2 + x 3 = 0, x\ + x\ + x\ = 3. (15) 

In fact, z is treated as the limit of triples (x + txi,x + tx 2 ,x + tx 3 ) 
in X 3 \ A. [xi,X2,X3] is the equivalent class (xi,X2,x 3 )/ ~, where 
(xi,X2,xs) ~ (x[,X2,x' 3 ) when Xj = Axj, A > ([FM]). The coordinates 
of neighborhood of B\ can be identified as (x, [x±,X2, x 3 ], t). When t ^ 0, 
(x, [x\, X2, x 3 ], t) = (x + txi,x + tx 2 ,x + tx^) G X 3 . Usually we drop t 
if it is equal to 0. Now we blow up Xf along Aj, I = (1, 2), (1, 3), (2, 3) 
and get X%. Consider A( lj2 ). There is a natural normal bundle for 
A(i ; 2) \ B123 in X| \ B\2z '■ The fiber over point (x,x,y) is identified with 
normal directions given by {x + tx',x — tx',y),x G TX X . This bundle 
can be naturally extended over A( 12 ) PI i?i23- We describe this in the 
neighborhood of B '123. Given a point (x,[xi,xi,xs],t) = (x, x + txi, x + 
txi, x + txs) with t > 0, a point y G TX x+txl in the normal direction is 
of the form 




,=o(x + tx\ + tsy, x + tx 2 - tsy, tx 3 ). 



The path can be given in terms of our new coordinate system. One can 
show that it can be extended over t = 0. This shows the bundle converges 
to the normal vector o/A( 12 ) n -B123 in B123. Identify the neighborhood 
of An 2) as this normal bundle and we can apply the semi-blow-up along 
A(i,2)- The new boundary is denoted by B\2- This is a stratified space. 
We denote the coordinates for the smallest stratum B\2 fl -B123 in the 
following way: 

{x[xi[y, -y],x 3 ]). 

Note that x is followed by a bracket [xi[...],x 3 ] and x\ is followed by 
[y, —y] without comma. We explain the format of the notations: xi,x 3 G 
T X X, y,—y G T Xl (T x X). The geometric meaning is that there are two 
points (since the bracket after x contains two points) that run into x 
and cause a blow-up at x and then there are two points run into x\ and 
cause the second blow-up at x\ . Moreover, the coordinates satisfy 

2xi + x 3 = 0; 2x\ + x\ = 1; y 2 + {-yf = 1. 

Correspondingly, blow up along A( 2) 3), ^(1,3) an d create the other two 
boundaries and denote them by £?23,-Bi2- The resultant space is X 3 
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and F(X,3) = X2/S3. Totally, there are four strata 1 < i < 4 in 
the space. We describe the strata and formulate the representations of 
points in the strata as follows. 



We explain the notations of points: for [x,y,z], x,y,z G X 3 \ A and 
[x,y,z] = (x,y,z)/S 3 ; for [x[y,-y],z], x,z G X 2 \A. y,-y G TX X , this 
corresponds to a blow-up at x and described by x[y, —y\; for [x[y, z,w]] 
this means there is a blow-up at x and y,z,w G (TX X ) 3 \ A; Similarly 
[x[y[z, —z], w]] says there is a blow-up at x and y,w G (TX X ) 2 \ A, and 
then followed by a blow-up at y and z, —z G T(TX x ) y . 

In general, a point in F(X, n) is denoted by 



Any point y included in the bracket following a point x is called de- 
scendant of x. Two points in the same level, like xq,x\ in the example, 
are called siblings. By the construction, siblings should differ from each 
other. It is clear that the representation of points in same stratum has 
same format. So we also say it is the format of the stratum. There is a 
natural way to construct bubble trees in terms of the formats of strata. 

Example 4.8. We continue example 4-7. We intend to build bubble 
trees Ti such that Mt { = Suppose Tj G Tk- In- the construction, 
we do not care what the weight of the root vo is. vq is always assigned 
with X. Weights for other nontrivial vertices are all equal to 1 in the 
construction. Namely 



ill 

n 3 
n 4 



{{B 12 U B 23 U B13) n B 123 }/S 3 , [x, [y[z, -z],w}}; 
{B 123 \ (B 12 U B 23 U B 13 )}/S 3 , [x[y, z, w}} 
{{B 12 U B 23 U B13) \ ^123}/^, [x[y, -y},z}; 
F(X,S),[x,y,z] 



p= [s [- ••[■••]], xi[.. .],...]. 



i2(Ti) = [K-3[0[l,0[l,l]]]] 

R(T 2 ) = [K-3[0[l,l,l]]] 

R(T 3 ) = [K-3[l,0[l,l]]] 

R(T 4 ) = [K-3[l,l,l]} 
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and representations of bubble tree mainfolds of Ti are 

Y (dr 1 ) = [xi[x 2 ,x 3 [x 4 ,x 5 ]]] 

Y(d T2 ) = [xi[x 2 ,x 3 ,x 4 ]] 

Y{d Tz ) = [xi,x 2 [x 3 ,x 4 ]] 

Y(dr 4 ) = [xi,x 2 ,x 3 ] 

Following these examples, one can generalize the correspondences 
for n > 3. A general condition similar to (18) is the balanced condition 
(3) on ghost bubble: suppose {xi, . . . ,xj~} are points on ghost bubble 
S 4 , then 

k k 

Xi = 0, and \xi\ 2 = k. 
i=i i=i 

We now generalize the construction to the weighted cases. Let to = 
(wi, . . . , w n ) G Z n , Wi > and S m be the associated permutation group. 
Define XJJ to be the weighted space of X n . It is the collection of points 
(xi, . . . ,x n ) G X n with assigned weights Wi to Xj. Define F m (X,n) = 
Xl^ \ A. Similarly, we can define 

F m (X,n) = X^/Sto- 



Example 4.9. F m (X,3). 

X^ 3 : the key point is to choose a normal bundle and its sphere bundle. 
Define a sub-bundle N m C TX x x TX x x TX -> A (12i3) = X. For 
any x G X the fiber consists of 

3 

{(xi , x 2 , x 3 ) G {TX x f | ^2 WiXi = 0} 

i=i 

Take sphere bundle SN m of N m by imposing 

Wi\xi\ 2 + W 2 \x 2 \ 2 + 1^3 | 2 = ^1 + ^2 + ^3 

with WiXi = 0. This is the balanced condition. In general if there 
are {xi, . . . , x^} points with assigned weights w = (w±, . . . ,Wk), then 

k k 

WiXi = and Wi\xi\ 2 = Wi 
i=i i=i 
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We call this the balanced condition of weight w. These definitions are 
made to be consistent with (3). The neighborhood of An 2) 3) in X 3 \ 
An 2,3) can be identified with iV ro and so with (0, 1) x SN m . The weighted 
blow-up is to replace the neighborhood by [0,1) x S m . One can mimic 
example ^.7 and do (weighted) -blow-up of X| along A( 12 ) and so on. 
The choice of bundle and its sphere bundle is made to be consistent with 
balanced conditions. 

Weighted spaces are natural objects for our model and they are the 
generalization of FM-models. Given to = (w±, . . . ,w n ), a tree T is to- 
type if it has only n nontrivial vertices, say vi,... ,v n , with weights 
w(vi) = Wi and all vertices are ghosts except leaves. Let G m be the 
collection of all to-type bubble trees. One can show that 

Lemma 4.10. F n (X,n) is a stratified space. Moreover 

F m (X,n) = U TeGm M T . 

Now we know that F m (X, n) is obtained by a sequence of blow-ups. 
And example 4.7 also shows how to identify strata of F m (X, n) with Mt- 
If we are just looking for an orbifold compactification of F(X,n), obvi- 
ously our FM-space is a not a wise choice. The easiest one, of course, 
is l' 11 ' itself. Note that to go from F m (X, n) to X^ L \ we can reverse 
the blow-up process and do blow-downs. F m (X, n) is the right model to 
describe A4k(X). The redundancy of F w (X,n) comparing to j'"' sug- 
gests that we might simplify Mk{X) and get a better compactification. 

Moduli spaces with marked points and Ghost Strata 

The concept of marked points is very common in the theory of J- 
holomorphic curves. It is natural to introduce the same concepts to 
our case without additional difficulties. This is not essential in our 
compactification, however, they make the explanation simpler. 

Definition 4.11. Let to = (w±,... ,w n ). A marked point tv-instanton 
of X is an instanton A with an unordered tuple (x±, . . . ,x n ), x% E X 
and Xi ^ Xj,i ^ j. Moreover Xi are associated with charges Wi. Denote 
such an element by [A, (xi,... ,x n )]. 

For to = (wi, . . . , w n ) define 

■M-K,m = {[A, (xi, . . . , Xfc)] is a marked point 
to - instantonj^ <G Mk (X)} 
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where Kq = K — Yl w i- Basically, the bubble tree compactification 
and gluing theory can also be generalized to this case. We denote the 
compactified space by M.k,ko{X). We outline how these generalizations 
can be done: 

Bubble tree compactification of Mk,w Suppose there is a sequence of 
marked point instantons {[A n , (x n i, . . . , x n kW^ = i- Either bubble points 
of {A n } or points x satisfying 

limx ni = limxnj = x,i ^ j. 

are called the bubble points. We consider the bubble point of second 
type. Suppose p is a bubble point and linix n i = lim x n 2 = p. Take unit 
disk B p (l) but treat (A n , x n \, x n 2) as generalized connections. Using 



the construction of bubble tree compactification [19| such as dilating, 
transition etc., we get a marked point instanton on sphere and so on. It 
is easy to see that if we fix A n = A the compactification coincides with 
FM compactification of weighted cases. 

We can mimic §3.1 to define the compactified space which is consis- 
tent with the bubble tree compactification. Some definitions should be 
changed slightly: 

1. Marked point bubble trees: Besides the general notations we men- 
tioned in §3.1, Wi are assigned to marked points. W(v) is defined 
to be sum of Wi's assigned to v and W(vi) as defined in §3.1. Then 
definition 3..1 can be used to define marked point bubble trees. 

2. Marked point bubble space of a marked point bubble tree: suppose 
X is the bubble space assigned to the bubble tree. For each vertex 
v suppose the assigned manifold is X v . For all W{ assigned to v 
they define distinct marked points qi £ X v which are different 
from the bubble points on X v , and Wi are weights assigned to 
these points qi. 

3. Marked point bubble tree instanton: treat qi as part of generalized 
instanton on and use the definition 3.3. 

We can similarly define Mic,to(X) an d etc. The gluing theory can be 
extended to these spaces similarly. 

We now introduce some terminologies that are related to ghost 
strata. Given a ghost tree T, we suppose v\, . . . ,vi are all ghost vertices 
of T. If Vi satisfies W(vi) = mini<j<; W(vj), we call V{ to be an ends of 
T and W(vi) to be the energy of the end. Suppose T = (V, D) is ghost 
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tree and v is an end. Let d{v) be the set of vertices that are descendants 
of v and V = V \ d(v). T' = (V, D') is the subtree of T induced by 
V', moreover assign W(v) of T to v to make T' a marked point bubble 
tree. We say T' is obtained by cutting T at v. 

We now consider a typical ghost tree (T,v ) <G T^: child(vo) = 
{v±, . . . ,Vk} and Vi, 1 < i < are ghost; For each Vi child(vi) = 
{vn,... ,vu t } and Vij are leaves. We now describe the stratum St- 
Let 

h h 
XV := {w ,wi,... ,w k ) := (w(v ), ^ wfaj), . . . ,J2 w ( v kj)) 

3=1 3=1 

and K = Yli=i w i- ^ is clear that St is a fiber bundle over Mk,w{X). 
The fiber is contributed by two components. One component is bubble 
points on ghost bubbles. This can be described as follows: let it : 
Mk,xo{X) — ► F ro (X, k) be the projection map on marked points. Define 

k 

S T = ir*l[SN K , 

i=i 

= ( w ( v n)> • • • ) w ( v ik))- The other component comes from the moduli 
spaces on v^. Let 

i=i j=i 

and Z T = Me-, w (X)xZ t . Then 

5 T = St x M K , m (x) 2 T - 

Since SN m / are (almost) sphere bundles, St are product of k sphere 
bundles over certain base. This structure is the key to our flip resolu- 
tions. We explain how this can be done assuming that SN m / are indeed 
sphere bundles: first, note that The gluing bundle GLt whose fiber is 
GIt is a pull back bundle GL^ over Zt- In other words, it is trivial 
over fiber of St- The neighborhood of <St therefore is 

U{St) = S t x Zt GL' T /F T . 

For simplicity, we assume k = 1. Then Tt = SU(2) up to some finite 
group. The flip resolution on this model is exactly the same as what we 
did in §3.1. 



49 



4.3 Flip Resolutions on Mk(X) 



We already see how a single flip resolution can be applied to resolve 
singularities. But when K > 2, ghost strata are complicated. We need 
a systematic process to apply flips. The arrangement is given in the 
proof of the following theorem. K = 3 is a representative case to see 
how this works. So along each step in the proof, the case of K = 3 
is associated as an example. We introduce some notations for Ms(X) 
first: there are seven ghost strata. The trees are 

r 1 = [0[0[0[l,l],l]]]; T 2 = [0[0[1,1[1]]]]; 
T 3 = [0[0[1,2]]]; T 4 = [0[0[1,1,1]]]; 

T 5 = [p[l[0[l,l]]]]; T 6 = [0[0[1,1],1]]; 

r 7 = [i[o[i,i]]]. 

Other non-ghost trees are 

T i = [3]; To2 = [2[l]]; T 03 = [1[1, 1]]; 

T 4 = [l[2]]; r 05 = [l[l[l]]]; 

Tu = [0[T 0t ]}, 1 < i < 5; 

T 21 = [0[1, 2]]; T 22 = [0[1, 1[1]]]; T 23 = [0[1, 1, 1]]. 

Theorem 4.12. For any integer K > by finite steps of "flips", the 
singularities of Mk(X) can be resolved. Denote the new spaces by 
M K (X). 

Proof: We prove this by induction for both M K (X), M b K . The first 
nontrivial case is K = 2. This is proved in §4.1. 

Now consider general K. The resolutions are done inductively on 
the end energies m < K of ghost trees. 
1, m = 2. 

Pick up bubble trees in Tk which has at least one end with energy m = 2. 
Suppose T is such a tree with k such ends and v\,... ,Vk are these ghost 
vertices. Set V = (v±,... ,Vk). Let T\ be the k— marked bubble tree 
obtained by cutting off these ends from T. Let to = (K — 2k, 2, 2, . . . ,2). 
Then St 1: w C M. K,ro- As before 

St = St x s Tl:W 2 T - 

Here St is the product of sphere bundles SNi,l < i < k, over Zt, 
where the sphere structure of SNi is obtained from the ghost vertex V{. 
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Actually, this is true only for a proper subset U C Sri,™- For simplicity 
we take it as St 1>w right now. We write 

Gl T /T T = (Gl T , Vl x • • • x Gl T , Vk ) x Gl T -v =■ Gl T ,v x GZ T -v, 
where 

hi 

GL T>Vi = R 4 /Z 2 x ]J(R*/Z 2 ), 

3=1 

and GIt-v is defined in terms of (4.7). In this case ki = 2. Correspond- 
ingly, there are bundles GLyy and GLy_y over Zt such that 

GL T = {S T x Zt GL Ty x Zt GL T _ V }/(JJ50(3) W . x F T _ V ) 

i 

where \\ i SO(3) Vi x Tt-v = Tt- We can apply k flip resolutions simul- 
taneously to k-pairs (SNi, Gl Vi ). So the stabilizer for the new model is 
Tt-v- As we pointed out, the flip resolutions only apply to the proper 
set of each stratum. Namely, we apply the flip resolutions to each chart 
given in D(X,K). We need that resolutions are compatible on different 
charts which intersect. This compatibility is promised by the compati- 
bility condition R3' in §3.3. 

After doing flip resolutions, we update Tk'- 
[1], get rid of trees whose strata appear in St. 

[2], distribute the exceptional divisors to those corresponding trees. We 
now explain how to do this. Suppose T has only one end to be flipped. 
When the number of flip resolutions is more than one, the method is 
same. Suppose v is the corresponding ghost vertex. Let v-\ be the 
parent of v and child(v) = {v±, ... , Vk}- Then 

Gl T ,v = Ro/%2 x (Rf/Z 2 x ... x Ri/Z 2 ). 

Here Rq is the basic gluing parameter assigned to the edge (v-\,v) 
and R\ are assigned to (v,Vi). The assignment of trees to points in 
the exceptional divisor is determined by its coordinates in SGLt, v , the 
sphere in GIt,v, in a natural way: suppose the coordinate of the point 
is [xq, . . . , Xfc]. Here [*] is the equivalence class with respects to Z* 2 s 
and isotropy group SO(3) actions. If Xi ^ 0, i > 0, we contract T at 
its corresponding edge. The resultant tree is assigned to the point. We 
now illustrate this step for K = 3. The ghost strata to flip when m = 2 
are those of trees Ti, T5, T@, T7. Among them, only Sr l is compact. So 
for the other three strata, their charts only cover some proper sets. The 
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resolutions for these three strata only resolve the singularities on these 
proper sets. The resolution of the rest is done by the resolution on 
GL^ . They would patching together well because of the compatibility 
condition. T^ : Tq : Tj are replaced by certain exceptional divisors and 
they are no longer singular. The interesting one left is T\. There are 
two components in the exceptional divisor which are still singular. One 
is added to St 4 and the other component is added to St 3 (or St 2 )- 

Define the resulting space as M.k,2{X). 
2, m < K. 

Pick up bubble trees in Tk which has at least one end with energy m. 
Suppose T is such a tree. For simplicity, we assume T contains only one 
such end. For multiple ends the argument is same as what we do for 
m = 2. Suppose v is the ghost vertex. Let child(v) = {v\, . . . , Vk}, Wi = 
w(t(vi)), and to' = (wi,... ,Wk)- We make a further assumption that 
Vi are leaves. The reason we can make such an assumption is that by 
induction, M^., corresponding to t(vi), are completely resolved. So we 
can assume that M b w . is assigned to vertex V{. For instance, in our 
example St 2 can be treated as a submanifold of St 3 in this sense. By 
assumption ^ Wi = m. Set to = (K — m,m). Let T\ be the 1-marked 
point bubble tree obtained by cutting T at v. We still have sphere 
bundles St and Zt, which it is better to denote by Z_ T , over 5ti,w 
One important fact is that St is(!) a sphere bundle. Note that this 
is not true until we finish previous (m — l)-step filp resolutions which 
provide all necessary blow-downs. In fact, if not, there must exist some 
ghost stratum in lower level which has an end with energy less than m. 
But this is impossible. So we can continue the flip resolution to the 
neighborhood of St as before. By finite steps, all ghost trees should 
disappear from Tk- 

As an example, let us consider K = 3 again. The ghost strata to 
resolve for m = 3 are T2,T%,T4. As we said, T2 and T3 can be treated 
together to be one stratum. We still denote it by St 3 - For T4, originally 
St 4 is almost a sphere bundle but it is not a sphere bundle! To see 
this, it is -B123 in example 4.7 diagonals are replaced by the exceptional 
divisors of semi-blow-ups. Applying the flip resolutions to the step for 
m = 2, the model now corresponds to X|. these blow-ups are killed by 
blowing-down. So St 4 is now a sphere bundle. It actually corresponds 
to B123. Then we again apply flip resolutions to these strata. This 
completes the flip resolutions for K = 3. q.e.d. 

We now derive some consequences from our smooth compactified 
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moduli spaces. First consider the 50(3)-bundle M b ^ over M b K . After 

compactification, is a fibration over Mr, but the bundle structure 
fails exactly at where the ghost bubbles occur at the principal compo- 
nent. By exactly same constructions, the flip resolutions can be applied 
to both spaces. For greater convenience later on, we consider a bundle 
TZM b ^, which is a semi-blow-down of -M^-° x R + . Basically, we just 
add a R + factor to the fiber 50 (3) and make it to be R 4 /^- Note that 
this bundle plays a role as part of gluing parameter in M(X). So by 
the same construction given in theorem 4.12, one can construct smooth 
orbifold bundle TZM%° over M b K . The fiber is R A /Z 2 . Take its sphere 

bundle and call it M^. It might be more natural to resolve A4^° di- 
rectly. In fact, this gives the same bundle. As we said, the SO (4) action 
on R A induces an action on M b K and M.£. It is not difficult to check 
that the actions can be extended to all compactification spaces. So we 
have 

Corollary 4.13. M_f is an 50(3) -bundle over Ai b K . Moreover, the 
bundle is SO(4)-equivariant. 

We now consider some useful sub-orbifolds in M K (X) and their 
normal bundles. Let < I < K. We put all /-level strata together and 
denote the set by S l . Each component of 5', given by a partition t of /, 
is a sub-orbifold and denoted by S[. From the construction of M K (X), 
we have 

Corollary 4.14. Suppose t = (h, . . . ,lk) is a partition of I. In A4 K (X) 

k 

S{ = [M K -i{X) x (Fr k (X) x 50fc(4) l[Ml)]/S t . 

i=i 

Rs normal bundle, denoted by MS\, is 

k 

[(Mk-i(X) x Gp i xP fc ( ) x xk Fr k {X)) x 50fc(4)x50fc(3) l[KJU$f)]/S t . 

i=i 

In particular, when M.k-1 is a point. 

k 

S[ = [Fr k (X) x sok{4) J} Atfj/St. (16) 
i=i 
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and 



k 

AfS[ = [[J(P_j x x Fr{X)) x 50(4)x50(3) KM*?]/ St. (17) 
i=i 

Later, for the model of the wall-crossing formula, Mk-i is not a 
point. But we only take a neighborhood of a point. So the models we 
consider are products of with (19) and (20). Note that these two 
models are very nice. Essentially, they have product structures and so 
usually the computations are reduced to the case that t contains only 
one element. 



5 Kotschick-Morgan Conjecture on Donaldson 
Wall-crossing Formula 

In the second part of the paper we demonstrate how the new compact- 
ifications can be applied. As an example, we prove the KM-conjecture 
in this chapter. It turns out that the localization theory of equivari- 
ant cohomologies is very helpful to the wall-crossing formula when we 
working on smooth spaces. The similar situation to the Seiberg-Witten 
theory was studied by Cao-Zhou (||). Recall that the wall-crossing for- 
mula is expressed in terms of the link. In §5.1, we explain how links are 
related to the equvariant integration. Then in §5.2 & 5.3, we apply the 
localization formula to our new compactified moduli spaces and prove 
the conjecture. 



5.1 Review on Equivariant Theory and Localization For- 
mula 

In this section, we simply review the equivariant theory and concentrate 



on the localization formula. Readers are referred to [flj,[|2| and [15] for 
details. 

Suppose G is a compact Lie group and M is a topological space. Set 
Mq = EG Xq M, where EG -> BG is the universal G-bundle. The 
equivariant cohomology of M is defined by 

H* G (M) :=H*(M G ). 

Since EG x G M is a bundle over BG with fiber M, H G {M) is a mod- 
ule over H*(BG). In particular, H*{BS r ) = C[«],fl*(BSpin(4)) = 
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C[cr,cl] (see lemma 5.10). Suppose it : E — > M is a G-equivariant 
vector bundle. The equivariant Thorn class is denned to be the Thorn 
class of ttg '■ Eq — > Ma- 
li M is smooth, the equivariant de Rham theory is also available. 
Assume dimM = 2m. From now on, we only consider G = S 1 . Let 
A*{M) be the graded algebra of differential forms of M. Counting u as 
an element of degree 2, we grade polynomial ring A*(M)[«] as well. So 
a degree 2k polynomial is in the form 

7 = a u k + aiu k ' 1 H h a k . (18) 

We call aou k is the leading term of 7. Define 7r„i = a n , the 2n-form 
coefficient. Suppose V is the vector field on M generated by the S 1 
action. Define an operator dgi : A*(M)[u] — > j4*(M)[u] by 

dgi'j = (d - i v )j, 7 G A*(M)[u]. 

We say that 7 is an S 1 -equivariant form if 7 is d s i -closed. If 7 has 
degree 2k given in the form as in (21), it is equivariant iff 

iv a n = da n -i, 1 < n < k. 

These definitions can be easily generalized to A*(M)[u, u^ 1 ] where u _1 
has degree —2. 

We now explain the localization formula. From now on, we assume 
that all equivariant classes are represented by de Rham forms. Suppose 
F = U{Fi is the union of all components of stationary sets of the action. 
Let Ni be the S^-equivariant normal bundle of F{ in M and Oi be 
its equivariant Thorn classes. Then the equivariant Euler class of the 
bundle 2Vj is Ei = We endow M with an S 1 -invariant metric. 

Define a 1-form 9 on M - F such that 9{V) = l,9\ v ± = 0. Here V ± 
is the orthogonal complement of V in the tangent space. Then 9 is a 
connection on the principal bundle M — F — > (M — F) /S . Equivariant 
forms have some properties over M — F. We list two useful lemma here. 
By a direct computation, one has 

Lemma 5.1. Suppose 7 = fj, + fu is an equivariant two form, then 
r(a) = fi — 9 A df + fd8 is an S 1 -invariant form over M — F. 

A simple result for equivariant forms with general degree is also true 
(|J). We only need degree 2 forms in our application. 
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Lemma 5.2. If 7 is an equivariant form. Then 7[ m ] is exact outside 
F. 

Proof: Following the definition, one can check that 

l[m] = d {8 A 7[m-i] + A d6 A 7 [m _ 2 ] + A (c$) 2 A 7 [m _ 3] +■••). 
This proves the lemma. q.e.d. 
Theorem 5.3 (Atiyah-Bott). #7 G H* S1 (M), 

7 = £/ J- (19) 

Note that the degree of Ei is equal to the codimension of Fi and its 
leading term does not vanish. The right hand side of (22) makes sense 
(in A*(Fi)[u,u~ 1 ]). The same situation also holds when we restrict 0j 
over a neighborhood of Fi. 

Proof: Let Ui,U' i ,U i be ^-invariant tubular neighborhoods of Fi 
with relations Ui D U- D C/j . Suppose that 0j, 0^ are the Thorn classes 
of the normal bundles and they are supported in Ui , U- respectively. 
Moreover, we require that 



Let 7j = ip.^f. Compare 



£ / f = £/ 



to f M 7, we have 



E/ H^;-/7 = E/ 7(1-1)-/ 7, (20) 



note that q 1 — 1 are supported in M \ and are equal to —1 when 
in M \ thus 7' := ]T\ ^- A 9; - 7 is an well defined equivariant 
form which is supported away from all stationary sets. By lemma 5.2, 
j M 7' = 0. This implies that the right hand side of (23) is 0. So we 
prove (22). q.e.d. 

The Localization theorem also has a version for manifolds with boundary 
([14], |H). Suppose M is a manifold with boundary and dM n F = 0. 
We state the theorem for a special case that we need later. 
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Theorem 5.4. Suppose 7 = fj, + fu is an equivariant 2-form of M and 
r("f) is the one defined in lemma 5.1. Then 



f = W (21) 

JdM/S 1 t JF % &i 

where 2m = dim M, 2n, = dimFi . 

Proof: Let 7 = j m ~ 1 u. Consider J M 7. By the same argument used 
in the proof of theorem 5.3, one also has a form 7' supported outside F. 
Moreover, 7' = 7 near dM. It is sufficient to show 

7'=/ r m ~\ 7 ). 
M JdM/S 1 

By lemma 5.2, we know that 7' = dn for some n over M — F. Note that 
n is trivial over a neighborhood of F. And in a neighborhood of dM, 
one can show that 

f = 7 = ( |(0Ar m - 1 (7)). 
Therefore by Stokes theorem 



7'= / 9Ar m - L (j). 
M JdM 

Since r m ~ l {^/) is horizontal, applying integration along fiber to the right 
side, we show (24). q.e.d. 

In (?), the left hand side corresponds to the "link" in our application. 
Finally, we make a remark on F. If the stationary sets are orbifolds, 
these results are still true up to proper constants depending upon the 
orbifold structures. 



5.2 Reducible instantons 

Here we explain the local models of reducible connections in Ad K (X, A) . 
All the constructions and techniques are well described in ||. Here, we 
state most of results without giving proofs. Readers are referred to 
corresponding theories in Q. 

First assume the reducible connection [A] with respect to a is in the 
top stratum Aix(X, A). It is known that 

Lemma 5.5. The neighborhood of [A] in Mk(X, A) is diffeomorphic 
to a neighborhood Ua/^a of in C N /Fa, where N = —a 2 — 2 and S 1 
action is complex multiplication. 
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Proof: In general, the local model at [A] is given by the \1/ (0) for a 
r^-equivariant map 

* : H [A\ 

When = 1, iJ^j is 1-dimensional. By choosing a generic path A, 
1-dimension parameter t can be mapped onto Hyn ( p7| ). So the lemma 
follows, q.e.d. 

The universal bundle P can not be defined over U/S 1 x X. However, 
there is a bundle P — ► Ua x X with compatible Ta action such that P = 
P/r^ over U — {0}/S 1 . Our main strategy is to apply the equivariant 
theory over Ua- 

Lemma 5.6. For any £ € H2(X,Z), the restriction o//2(£) to Ua can 
be extended as an equivariant class + fu such that 

f\{0} = -^(«, s )> 

The proof is exactly the same as the one in §5.1.4 (0). Alternatively, 
one can construct r^-equivariant line bundle representing 2/2(£) over U. 
Ta acts weightedly on the line bundle (§5.2 in p|). Then the equivariant 
Chern class of the line bundle is 2/i(S) — (a, £)u. 

Now consider the reducible connection [A] with respect to a located 
in lower strata. Suppose r = (a 2 — pi(P))/4. The family of reducible 
connections in M. K (X, A) is [A] x Sym r (X). Let N = —a 2 — 2 and U 
is a neig hborhood of {0} in C^. When a 2 = -1, one has to consider 
the obstruction bundle. The localization formula can still be applied to 
the so called "virtual neighborhoods" ||. The main result, proposition 
5.9, is still true for this case. Now assume a 2 ^ —1. The local model of 
[A] x Sym r (X) can be written as (Ua x QL a ^ r )jY a- In general, QC a , r 
is rather complicated ( fllQ| , |]r8|| ). Given a E £ H2(X,'L), there is a line 
bundle — > X whose ci is the Poincare dual of £ in H 2 (X, Z). For 
a line bundle L — > X, L n /S n is a line bundle over Sym n (X). We denote 
the bundle by Sym n (L). Sym n (u) is the c\ of Sym n (LY,). Mimicking 
the construction in §7 of ||, we have 

Proposition 5.7. There is a T a- equivariant line bundle C a ^ r over U x 
GC a ,r which is isomorphic to £ Qj o <S> Sym r (L 2 ^) . C ajr /TA restricted to 
(U x gC a ,r/r A ) \ [A] x Syrrf(X) represents 2 / u(£). ' 
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5.3 The Proof of Kotschick-Morgan Conjecture 

In this section, we apply the localization formula given in theorem 5.4 
to prove the KM-conjecture. 

As we mentioned, all properties in M K (X, A) can be lifted to both 
M K (X, A) and M K {X, A) by tt*, i = 1, 2. Let U^ T = ^{Ua x QC^ r ). 
To apply theorem 5.4 to U a ^ r we first study the fixed loci of Ta and how 
/x(S)'s are equivariantly extended. 

Proposition 5.8. The fixed loci o/Ta inU a . r are disjoint smooth com- 
ponents F t := [A] x S_l(X), where t = (t\, ... ,ti) runs over all parti- 
tions of r. The pull-back bundle ir 2 C air is a Ta equivariant line bundle. 
Its equivariant chern class restricted to the fixed loci F t is {a,T,)u + 
2Sym r (oj) . 

Proof: We explain the first statement for the case r = 2. For r > 2, 
the phenomenon are same. 

We begin with the model in Mk{X, A). We explain how Ta acts 
in this model and show how the situation changes after applying flip 
resolutions. Let U\ := 7rj~ (U x QC a ^ r ). The partitions of r = 2 are 
ti = (1,1) and t2 = (2). The reducible connections are parameterized 
by two components Fi := [A] x Sym 2 (X) \ A and F 2 := [A] x Xx M\. 
Here Fi corresponds to the partition tj. Note that F\ intersects with F 2 
at ghost bubbles. We now clarify the situation around ghost bubbles. 
The ghost stratum can be identified as (U x STX)/Z2- The gluing 
parameter of this stratum is 

2 

Gl = Y[Rt/Z 2 . 

i=0 

The gluing bundle GL is the bundle over U x STX with fiber isomorphic 
to Gl. In fact, it is constructed by a pull-back bundle from X 

k*(M° k _ 2 {X) x M h f{S\X)) 2 xR+ xR + x R+. 

where tt : U x STX — > X is the projection map. Then the local model 
of the ghost stratum is U\ C GL/(Z2 x SO (3)). Ta acts only on the 
first factor of Gl. In general, it acts only on the gluing parameters 
associated to bubble points on X. Suppose (xq,xi,x 2 ) € Gl. The fixed 
loci of in U\ are those corresponding to (0, x±, x 2 ) and (x±, 0, 0). The 
first case follows from the action directly, the second case comes from the 
50(3) quotient. They are indeed the neighborhoods of ghost stratum 



59 



in F 2 , F±. This picture says that two different fixed loci F±, F 2 intersect 
at the ghost stratum. Clearly, this awkward situation is caused by the 
singularity at the ghost stratum. Now we explain how the situation is 
modified after flip resolutions. After the resolution, the local model is 

U = [U x (TX/Z 2 )xS ll /SU{2)]/T, 

here T is isomorphic to Ta up to some finite group. By the definition 
of flip resolutions, points in S n /SU(2) is [xo,xi,X2] where (xo,xi,x 2 ) £ 
S 11 and the equivalent relation is given by (xq, x\, x 2 ) ~ (c _1 xo, cx\, cx 2 ) 
for some c G SU{2). Now the fix loci of Ta are separated. They corre- 
spond to [0, x\, x 2 ] and [xo, 0, 0] respectively. On the exceptional divisor 
after flip, X x [xo, 0, 0] is just the natural closure of Sym 2 (X)\A. These 
two components are in S^., i = 1, 2. This proves the first statement. For 
the second part, we use the line bundle given in proposition 5.3 and pull 
it back to U a , r - One can show that the weight of T^-action on factor 
Ly, is trivial. Then the assertion about forms at fixed loci follows from 
the case of top stratum, q.e.d. 

So far, we have finished the study of equivariant forms over U a)T . We 
take the neighborhood of each fixed locus F t and denote it by U a ^ r ^. We 
will apply theorem 5.4 to this model. Here, 7 will be the equivariant 
extension of //(£). Usually, computing the equivariant Euler classes of 
normal bundles at fixed loci is highly nontrivial. We first recall some 
facts mentioned at the end of §4.3. These can be used to reduce the 
computations. Recall that if t = (ti, . . . , tk) is a partition of r, the 
corresponding fix locus is F t = S_ r t (X). Then its normal bundle is 

N t = U A xMSUX). 

Since these bundles have product structures when k > 1, we consider 
k = 1 first. Now the situation is: let t = (r), there is a bundle iVt 
over [A] x S t (X) = [A] x XxM*. The fiber is x c(SO(3)), where 
c(SO(3)) denotes the cone of SO(3). We now focus on the nontrivial 
part of the bundle: the subbundle N' t with fiber c(50(3)). Explicitly, 

N' t = [P a x M^.'°(S\X)]/SO(3). 

Also 

M? r :°(S 4 ,X) = Fr(X) x 50 (4) M^iS 4 ). 
By theorem 5.4, we have 
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Proposition 5.9. Suppose 9 is an equivariant form over IAq f . Take an 
equivariant neighborhood of F t in N i7 say £7 t . Then 

f r{9) = 2 I \- 9U (22) 

JdUi/s 1 J F t u N -(it - a) 2 +pi{r) 

where Pl (r) is the Pl of 50(3) -bundle M^'°(S A , X). 

Proof: The trouble is to deal with N' t . Without loss of generality, we 
assume that N = 0. Now iVt is an orbifold bundle with fiber isomorphic 
to R 4 /X 2 . We first make some assumptions so that the computation can 
be done under the best situation: (1) in general, this orbifold bundle is 
not a global quotient of some i? 4 -bundle. Let us assume that it can be 
identified as a global quotient for a moment. (2) Suppose both 50(3)- 
bundles P a , Mr'°(^ 4 , X) can be lifted to 5f/(2)-bundles, say Y 1 ,Y 2 . (3) 
Let Zi = Yi x 5^(2) C 2 . As we know, Z\ can be split as Z\ = L l J 2 ®L a 1 ^ 2 . 

1/2 1/2 

We assume that Z 2 can be also split as Z 2 = L 2 @L 2 where p±(r) = 
L\. With these assumptions, we consider the cone of Honisu(2){Zi, Z 2 ). 
This is actually a C 2 bundle, we denote it by Hom{Z\, Z 2 ). Then 

Hom{Z u Z 2 ) {L- 1 ' 2 L l J 2 ) (L^ 2 ® L^ 2 ). 

Ta = 5 1 acts on the bundle by (e l9 / 2 -,e~ l9 / 2 -). One can check that 
Hom(Zi, Z 2 ) gives a double cover of N t . However, this bundle ex- 
ists only when our assumptions hold. In general, we can pass to a 
quotient bundle of this bundle which always exists: let Go = 7* 2 x 
7L 2 act on Hom(Zi, Z 2 ) by multiplication. Then the quotient bundle 
Hom(Zi, Z 2 )/Go = (L^ 1 ® L 2 ) © (L a ® L 2 ) and the 5 1 action is given 
by (e ld ■ , e~ td ■) . There is a natural map 

i-.N^HomiZuZ^/Go 

which is a double cover. Now, applying the localization formula over 
Hom(Zi, Z 2 )/Go is legal. There is a small problem to deal with: 6 is not 
Go invariant, however we can replace 9 by the average of g*9,g € Go- 
This will not change the left hand side of (25). So we assume that 
9 can be reduced to Hom(Zi, Z 2 )/Gq. The equivariant Euler class of 
Hom(Z\, Z 2 )/Go is then 

(u — a + ci(L 2 ))(— u + a + c\(L 2 )) = -{u - a) 2 +pi(r). 

Counting the multiplicity of covering and apply the localization formula, 
we prove (25). q.e.d. 
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Now the main problem in the right hand side of (25) is to compute p\{r). 
To deal with this problem we use a trick that was essentially used in 



Suppose / : X — > BSO(A) is the map that induces Fr(X), i.e, 
f*ESO(4) = Fr(X). All associated bundles of Fr(X) can be obtained 
by /*• So M*{S 4 ,X) and M^ r :°{S 4 ,X) are the pull-backs of bundles 

GM^ 4 ))so(4) = ESO{A) x so(4) M^.(S A ) 

and 

CM f b ' (S 4 )) 5O(4) = ESO(A) x 5O(4) M^ (S 4 ). 

Moreover (Mr'°(S 4 ))so(4) is still an SO(3)-bundle over (Ml(S 4 )) so{4) . 
We denote the p\ of this bundle by p±. Then pi(r) = (f*)*pi- By def- 
inition, j>! is an equivariant cohomology of M^S 4 ). As it is suggested 



in [18], we work with Spin{4) = SU (2)l x SU {2)r equivariant cohomol- 
ogy rather than with SO(4) = SU(2) x% 2 SU(2). Working with rational 
coefficients, no information is lost, as evident from by this lemma. 



Lemma 5.10 ([|13|). Let cl,cr be the second Chern classes of SU(2)- 
bundles 

ESpin{A)/SU(2) R -> BSpin(A) and ESpin(A)/ SU{2) L -» BSpin(A) 

respectively, then H*(BSpin(A),Q) = Q[cl,cr]. 

If s : BSpin(4) — > BSO{4) is the classifying map for the natural 
50(4) bundle over BSPin(4) ande,pi E H 4 (BSO(4:)) are the universal 
Euler class and the universal Pontrjagin class, then s*(p\ + 2e) = — 4cr 
and s*(pi - 2e) = —Acl- 

Since pi(r) E ^ n(4) (M'(5 4 )), we have 

Corollary 5.11. pi(r) = ujq + ujlcl + urcr, where ojq E ij" 4 (A / f^(5' 4 )) 
and lol,ujr E #°(Air(5' 4 )). 

Now we are able to prove our main theorem. 

Theorem 5.12. KM-Conjecture is true. 

Proof: Suppose [A] is a reducible connection with respect to a and lies 
in level-r strata, i.e, r = (a 2 — pi(P))/4. Using the same notations as in 
proposition 5.7, the neighborhood of reducible family is U ajT . Let 7 be 
the equivariant extension of /2(E). So 7 = /2(E) + /u. Given a partition 
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t of r, we know 7 restricted to the fixed loci F t is Sym r (uj) + \ {a, Ti)u. 
Apply theorem 5.4 to this model by letting M = U a . r . It is not hard to 
see that 

r( 7 )=/2(£) + ±(a,£>d0. 

So use this information and plug r("/ d ) into the left hand side of (25). 
We get exactly Sp(a). By the Stokes theorem, one can easily show that 
5p(a) is the sum of 5 a ^ which is defined by 

So it is sufficient to prove the following statement about 5 a ^\ 8 a ^ is in 
the form 

r 

5 a , t :=J2^(r,d,X,t) q r -*a d ~ 2r - 2 \ 

i=0 

and as it indicates a% depend on r, dt and homotopy invariants e, a of 
X. 

First assume t = (r). Therefore, the right hand side of (25) is the 
constant term of it-series 

u Ar - 2 {r-a+{a,Y)) d - 1 



M r (S\x) -{u-af +pi(r) 



(23) 



where C(r) is some constant depending on r due to the orbifold struc- 
ture. Note that ir : Ai r (S 4 , X) — > X is a bundle with fiber M b r , we can 
apply the integration along fiber 7T* first. The contributions of forms 
in fiber direction come from p\{r). More precisely, they are terms con- 
taining pf r ~ 2 (r) and £>i r_1 (r). Since 7r* commutes with the pull-back 
(/*)*, we can apply integration along fiber at (M^.{S 4 )) SpinU) ^ TS ^ an d 
pass down to BSpin{4), then pull back by /*. By corollary 5.11, the 
integration over fiber operation 7r* at (Ai^S 4 )) SpinU) implies 

K*(Pi(r)) 2{r - 1] = A, and 7r,(p 1 (r)) 2r - 1 = B R c R + B L c L 

in H A (BSpin(4),Q), where A, Br, Bl are universal numbers depending 
on ujq,ujl,ujr. By pulling back, 

?r*(pi (r)) 2(r ~ 1} = A and 7r*(pi(r)) 2r - 1 = B R (2e + 3a) + 5 L (2e - 3a). 

With this fact, applying (26) we can verify the statement for 6r r \ (a). 
For general t, the statement follows from the product property. Thus 
we prove the conjecture, q.e.d. 
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